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FOREWORD 


This  report  discusses  the  experimental  procedures  used  and 
results  obtained  during  work  under  Contract  DA-18-001-AMC-954 (X) 
entitled,  ''The  Detection  and  Identification  of  Chemical  Signatures" 
Personnel  who  contributed  to  this  program  include, 

B.K.  Krotoszynski .  A.  Dravnieks,  Wanda  Bahmet ,  H.W.  Neumann, 

N.S.  Snaw,  T.A.  Stanley,  and  Joyce  Burton. 


Respectfully  submitted, 
I IT  Research  Institute 


Head,  Olfactronics  and 
Odor  Science  Center 


JL 


B.K.  Krotoszynski 
Senior  Chemist 


Delivered  by  1IT  Research  Institute,  pursuant  to  Contract  No. 
DA-18-001-AMC-954 (X) .  Government's  use  controlled  by  the 
provisions  of  Articles  51  and  26  of  Title  II  of  the  contract 
which  are  ASPR  9-107.2  and  5-203.4,  respectively. 
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DETECTION  AND  IDENTIFICATION  OF  CHEMICAL  SIGNATURES 

ABSTRACT 

In  the  work  on  the  detect,  ion  and  ident  if  i  cat  ion  of  human 
chemical  srgnatui  es  experimental  dtia  computei  lZed  statistical 
techniques  were  fiiat  developed  tested,  and  standardized  The 
experimental  techniques  solved  the  problems  of  sample  collection., 
recovery  injection  into  the  gas  chromatograph,  calibration  and 
character izat ion  of  signature  components.  The  sample  collections 
were  based  on  the  principle  of  equilibration  and  were  performed 
by  means  of  Apiezon  L-coated  Teflon  powder  in  the  form  of  a 
fluidized  bed.  The  collected  samples  were  transferred  into  a 
special  injector  needle  by  means  of  an  IITRI-designed  transfer 
system.  The  injection  of  samples  into  the  gas  chromatograph  was 
performed  in  a  timed,  automatic  and  reproducible  manner  with  a 
specially  constructed  injection  system.  The  calibration  method 
permitted  correlation  of  the  concentrations  of  known  signature 
components  in  air  with  their  respective  peak  areas.  The 
sensitivity  of  this  novel  process  is  of  the  order  ol  one  part 
per  billion  (by  volume)  of  an  organic  component  in  air. 

The  developed  techniques  were  used  tc  collect  and  analyze 
vapor  samples  from  humans  temporarily  isolated  from  the  environment 
by  being  placed  in  a  speciaL  glass  cell  containing  high  purity 
air.  The  glass  cell  was  fitted  with  low-  ad sorq  t i ve  Teflon 
gaskets  and  a  Teflon  coated  resting  platform.  The  gas  chromato¬ 
graphic  blanks  of  air  obtained  from,  the  system  contained  a  few 
small  size  peaks  m  the  high- volat llity  range  of  the  chromatograph 
when  analyzed  by  the  developed  process- 

In  65  human  chemical  signatures  collected  and  analyzed  in 
the  packed  10  ft  Carbowax  20M  column  at  125 °C ,  32  to  48  peaks  per 
signature  were  observed.  Statistical  pattern  analysis  using 
stepwise  discriminant  analysis  on  these  human  chemical  signatures 
which  included  blocks  of  Caucasian  males,  Indian  (from  India)  males, 
Caucasian  males,  and  on  10  chemical  signatures  of  assorted 
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environments  showed  that  valid  discriminant  functions  can  be 
obtained  utilizing  5  to  15  of  the  chromatographic  ranges. 

These  findings  indicated  that  sour ce- type- characteristic  features 
exist  in  the  moderately  resolved  composition  of  vapors  from  humans 
obtained  gas  chro.matographically .  These  features  can  be  used  as 
a  basis  for  detection  and  identification  processes. 

Higher  resolution  gas  chromatography  at  several  temperature 
levels  in  a  200-ft  support- coated  open  tubular  Carbowax  20M  column 
exhibited  typically  100  to  130  peaks.  Twenty-five  of  these, 
located  m  the  gas  chromatogr aphic  ranges  where  peaks  were  most 
frequently  observed  in  the  lower - resolut ion  human  chemical 
signatures,  were  re- analyzed  in  a  specially  designed  two- column 
gas  chromatograph  in  a  50- ft  Apiezon  L  column.  Each  one  of  the 
25  peaks  consisted  of  3  to  10  chemical  components,  or  more  than 
150  components  (total)  when  examined  in  the  two  column  apparatus, 

The  polarity  of  the  components  indicated  the  presence  of  considerable 
amounts  of  hydrocarbons-  carbonyl  compounds,  and  others,  such  as 
alcohols.  Other  materials,  exceeding  the  alcohols  in  their  gas 
chromatographic  polarity,  are  probably  bif unctional  compounds  or 
those  with  a  functional  group  and  more  than  one  double  bond  in  the 
molecule.  Analysis  of  collected  samples  in  a  specially  designed 
three-column  gas  chromatograph  utilizing  computerized  data 
interpretation  similarly  indicated  a  great  complexity  of 
composition ■ 

Mass  spectrographic  identification  of  the  components,  with  a 
few  exceptions  (moth- proof ing  compounds  were  found)  was  not 
successful.  The  analysis  with  the  two-column  gas  chromatograph 
demonstrated  t  lat  the  principal  reason  for  this  difficulty  was  the 
lack  of  resolution,  even  by  analysis  using  a  single  200-ft  tubular 
column.  Satisfactory  future  identification  should  be  possible 
using  the  resolved  output  from  the  combination  200-ft  Carbowax 
20M/50-ft  Apiezon  L  columns  in  series. 

The  raw  data  were  tabulated  in  terms  of  Kovats  retention 
index  and  are  listed  in  the  Appendexes. 
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1 ,0 _ INTRODUCTION 

In  the  previous  project,  "Detection  and  Identification  of 
Chemical  Signatures."  Contract  Dk  -18-00 1~ AMC-954  (X)  ,  the 
following  work  was  conducted-'  (1)  a  glass  cell  apparatus  was 
designed  and  constructed  for  temporary  isolation  of  human 
subjects  in  a  stream  of  hiqhly  puiified  air  (2)  a  vapor 
collection,  sample  txansfer,  and  gas  chr omatographic  procedures 
were  developed  for  obtaining  a  gas  chromatogr aphic  representation 
of  organic  components  in  the  effluents  from  the  temporary  isolated 
subjects,  and  (3)  several  repi esentative  gas  chromatograms  of 
several  subjects  of  different  races  and  both  sexes  were  obtained- 

The  objective  of  the  present  project  was  (1)  tc  further 
improve  the  vapor  collection  and  evaluation  techniques,  (2)  obtain 
a  sufficient  number  of  chemical  signatures  (gas  chromatographic 
representations)  of  vapors  from  a  variety  of  subjects  so  that  some 
generalizations  would  become  possible  (3)  to  characterize,  as 
much  as  possible  ■  those  human  vaporous  effluent  components  that 
may  be  most-  suitable  for  identification  of  human  vapors  in 
various  environments,  and  (4)  to  prognosticate  on  the  most 
promising  approaches  to  the  effluent  composition  characterization 
processes  that  might  be  suitable  for  identification  of  human- 
related  vaporous  components. 

As  the  work  progr*--  sed  ,  the  variability  of  the  chemical 
signatures  among  the  l  -lviduals  and  the  complexity  of  the  vapor 
composition  emerged  as  the  dominant  features  complicating  the 
tasks  (3)  and  (4)  of  the  objectives  Consequently,  much  effort 
had  to  be  spent  on  techniques  of  data  obtaining  and  interpretation, 
to  facilitate  the  tasks  (3)  and  (4)-  The  work  consisted  of 
considerable  improvement  in  the  techniques  in  statistical  pattern 
analysis  to  direct  attention  to  these  gas  chromatographic  features 
which  are  most  human-related  and  nigh-resolution  gas  chromatography 
and  some  mass  spectr egrapny  to  better  characterize  the  chemical 
nature  of  the  oiganic  fractions  of  the  effluents  from  humans. 
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2.0 _ S ELECTION  OF  EXPER IM ENTAL  APP ROACHES 

The  previous  work  indicated  that  human  chemical  signatures 
vary  widely  among  the  individuals  and  to  a  considerable  extent 
in  the  same  individual  and  contain  many  organic  components. 

The  volatility  of  the  components  encountered  covered  a  broad 
range,  beginning  with  volatile  substances  apparently  belonging 
to  the  lowest  members  of  the  homologous  series  and  expected  to 
occur  in  most  atmospheres  and  extending  to  substances  with  much 
lower  volatility  such  as  expected  from  organic  substances  with 
ten  or  more  carbon  atoms  m  molecule.  It  became  obvious  that  the 
human-character  components  may  easily  occur  in  the  heavier  (lower 
volatility)  sections  of  gas  chromatographic  spectrum  and  that  the 
chromatographic  peaks  observed  most  likely  consist  cf  more  than 
one  substance.  At  that  juncture  two  courses  were  possible: 

(1)  work  toward  improvement  of  gas  chromatographic  resolution 
with  detailed  study  of  components  encountered  in  a  few 
individuals,  or  (2)  survey  of  chemical  signatures  of  many 
individuals  and  some  environments  with  intention  to  locate  those 
gas  chromatographic  features  which  would  be  most  suitable  for 
discrimination  of  humans  vs  environments  so  that  the  later 
higher-resolution  studies  could  be  concentrated  on  the  significantly 
discriminating  ranges. 

The  second  of  these  routes  was  selected,  to  avo:d  the 
possibility  of  wasting  initial  efforts  on  identification  of  those 
components  which  later  might  be  found  not  to  be  emitted  by  many 
humans  or  fce  not  particularly  suitable  for  detection  of  vapors 
from  humans  in  environments  Since  the  chemical  signatures 
extended  into  the  range  of  heavier  substances,  the  air  sampling 
technique  had  to  be  modified  before  data  accumulation  could  begin 
to  permit  sufficient  collection  of  substances  with  lower  vapor 
pressures . 

The  above  considerations  guided  the  experimental  approach 
through  the  following  stages: 
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Fluidized  bed  collector  method  with  suitable  vapor 
transfer  devices  and  calibration  apparatus  were  developed 
to  permit  systematic  and  reproducible  chemical  signature 
collection  and  transfer  to  the  gas  chromatograph. 

2.  After  preliminary  experimentation  with  the  fluidized  bed 
collectors  for  human  chemical  signature  collection, 
standan  zed  samples,  of  organic  sapors  were  obtained  m 
65  samplings  from  individuals  of  both  sexes  and  several 
races  (with  some  individuals  repeated)  and  in  10 
samplings  from  several  environments. 

3.  The  collected  samples  were  analyzed  using  iQ-ft  packed 
bed  Carbowax  20M  column  at  125 °C,  obtaining  75  gas 
chromatographic  patterns. 

4.  The  patterns  were  submitted  to  computerized  statistical 
analysis,  to  detect  those  features  that  are  most  suitable 
fcr  discrimination  of  different  vapor  source  classes 
e.g. ,  humans  vs  environment,  different  groups  of  humans 
(males  vs  females  Indian  males  vs  Caucasian  males,  etc.). 
Several  statistical  approaches  and  rationales  were  used, 
and  resulted  in  pointing  out  those  gas  chromatographic 
ranges  which  contained  components  potentially  useful 

for  source  class  discrimination. 

5.  Several  methods  of  high- resolution  gas  chromatography 
were  developed  to  increase  the  component  resolution  to 

a  degree  where  mass  spectrographic  analysis  of  individual 
components  would  be  possible.  These  methods  progressed 
through:  50-ft  and  200-ft  support- coated  open  tubular 
single-column  gas  chromatograph;  a  two -column 
chromatograph  where  peaks  from  the  first  high-resolution 
polar  column  could  be  individually  injected  into  a 
second ,  non-polar  column:  and  a  three-column  gas 
chromatograph  where  the  entire  pattern  co;  Id  be  obtained 
in  polar  and  non-polar  columns  and  correlated  using 
specifically  written  computerized  peak-matching  program. 
Because  of  the  high  complexity  of  the  sample,  the  three- 
column  patterns  lost  considerable  detail  and  appeared  to 
present  principally  the  gross  composition  of  the  chemical 
signature.  The  two-column  device  was  judged  to  yield 
most  information,  and  the  chemical  signature  of  one 
individual  was  studied  in  detail  by  repeated  analyses 
with  this  device,  inspecting  the  ranges  indicated 
significant  by  the  statistical  analysis. 

6.  The  initial  attempts  to  mass  spectrographically  identify 
the  components  after  resolution  in  a  50-ft  support- 
coated  open  tubular  Cart. .wax  20M  column  did  not  yield, 
with  one  exception,  clearly  identifiable  components. 

Later  work  with  similar  single  200-ft  Carbowax  20M 
column  and  with  the  two- column  device  indicated  that 
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even  the  peaks  emerging  from  the  200-ft  colunin  were 
further  separable  into  2  to  as  many  as  10  components. 

This  explained  the  difficulty  experienced  in  the  earlier 
mass  spectrographic  attempts  of  the  component  identification 
and  pointed  the  way,  in  the  form  of  the  two-column  device, 
for  obtaining  gas  chromatographic  resolution  sufficient 
for  more  successful  mass  spectrographic  identification. 

7.  An  initial  listing  system  for  classification  of  the 
components  of  the  chemical  signatures  in  terms  of 
retention  (Kovats)  indexes  was  adapted,  and  some 
considerations  for  chemical  signature  detector  devices 
were  evolved. 
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3.0  EXPERIMENTAL 


The  various  subsections  of  this  chapter  deal  with  the 
chemical  signature  collection  and  vapor  transfer  techniques, 
calibration,  subject  selection,  statistical  pattern  analysis 
experiments,  and  several  techniques  for  high-resolution  analysis 
of  the  collected  samples. 

3.1  Glass  Cell  Apparatus 

The  lS~in  I.D.  cylindrical  glass  cell,  with  Teflon  ring 
seals,  Teflon-coated  end  plate,  and  a  Teflon-coated  steel  frame 
stretcher  for  the  support  of  human  subjects  in  the  supine 
position  in  the  middle  of  the  cell  has  been  described  in  detail 
in  the  previous  report.* 

For  the  chemical  signature  collection,  the  subjects  were 
enclosed  in  the  cell  for  45-min  in  a  stream  of  highly  purified 
air.  The  air  was  taken  from  hospital  grade  cylinders.  Experience 
indicated  that  the  commercially  available  hospital  grade  air 
contained  organic  impurities  in  varying  concentrations  sufficient 
to  make  the  analysis  of  the  vapors  emitted  by  humans  uncertain  and 
cumbersome. 

Experiments  were  conducted  to  design  purification  procedure 
that  would  reduce  the  contaminant  level  in  the  air  fed  to  the 
cell  to  a  practical  minimum.  Air  first  passes  through  a  particle 
filter  (Vetricel,  Type  VF-6,47mm  Dia ,  Pore  size.45U.  Gelman 
Instrument  Company,  Ann  Arbor,  Michigan)  then  in  downflow  through  an 
activated  charcoal  bed  (Figure  i)  cooled  in  its  lowest  one  third 
section  by  a  dry-ice-propanol  mixture.  The  gradual  cooling  of  air 
in  the  charcoal  bed  prevents  aerosol  formation,  since  vapors 
adsorb  at  approach  to  their  dew  point  and  before  an  overcooling 
that  can  lead  to  aerosol  formation  occurs.  The  diameter  of  the 
carbon  bed  was  4-ir.  and  the  rate  of.  air  flow  0.75  1/sec.  The 


* Re port  IITRI  No.  U5012-4 ,  Contract  DA-11-022-AMC-1775 (X) 
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charcoal  bed  adsorber  was  activated  and  stored  at  200 °C  in 
nitrogen  stream  between  uses,  so  that  each  chemical  signature 
collection  experiment  began  using  a  freshly  reactivated  adsorber. 

The  purified  air  entered  the  glass  cell  at  very  low  humidity 
level,  but  sufficient  humidity  was  picked  up  from  human  subject 
to  result  in  40-50  percent  relative  humidity  at  the  exit  of 
the  cell.  The  exit  air  temperature  typically  was  23°  to  25°C. 

Higher  humidities  for  which  provisions  were  made  in  the  initial 
design  of  the  air  preparation  train  were  avoided,  since  the 
subjects  tended  to  begin  perspiring  profusely,  and  the  excess 
humidity  resulted  in  clouding  the  glass  and  in  difficulties  with 
water  condensation  in  the  vapor  collectors  kept  at  room  temperature. 

The  content  of  oxygen  in  the  air  decreased  by  less  than 
2  percent  absolute  in  passage  through  the  cell  when  it  was 
occupied  by  a  subject. 

3.2  Fluidized  Bed  Collectors 

The  organic  vapor  collection  was  conducted  using  the  principle 
of  absorption  in  highly  purified  grease  films.  This  principle, 
used  in  gas  chromatography  in  the  form  of  "pre-column"  collectors, 
permits  collection  of  organic  materials  from  vapor  phase  without 
interfering  accumulation  of  water  vapor,  if  non-polar  greases 
are  used.  However,  in  its  usual  form  of  a  packed  bed, consisting 
of  grease  or  oil -coated  inert  powders,  it  has  certain  disadvantages: 
the  vapors  of  substances  that  are  more  volatile  and  less  soluble 
in  the  grease  can  saturate  the  collector  upon  passage  of  certain 
limited  air  volume  and  the  excess  begins  to  pass  through  the 
collector,  while  the  less  volatile  or  more  oil-soluble  vapor 
components  may  still  continue  to  be  fully  removed  by  the  collector 
from  the  air  passing  through  the  collector.  Therefore,  at  any 
given  volume  of  air  passed,  some  vapor  components  will  reach  the 
equilibration  concentration  and  their  amount  in  the  collector  will 
not  further  increase,  regardless  how  much  more  air  is  passed. 

Other  components  will  continue  to  accumulate  until  their  specific 
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equilibration  conditions  will  be  met. 

The  maximum  amount  of  a  substance  that  can  be  collected  from 
its  vapors  in  air  is  determined  by  several  physicoche .deal 
variables  combined  as  follows: 

The  number  of  molecules  of  certain  species  collected  is 

n  6. 1023  d  ,  1  .  |la  ,  v  (I) 

M  t  n0 

Here  d  amd  M  are  the  density  and  the  molecular  weight  of  the 
film  substance,  respectively;  7T  is  the  activity  coefficient  of 
the  species  in  the  film;  n  is  the  concentration, molecules/cm3 , 

O 

of  the  species  in  the  film;  nn  is  the  concentration  in  gas  at 

u  3 

saturation  at  the  same  temperature;  and  V  is  the  volume,  cm  ,  of 
the  film  phase  in  the  sampler. 

The  activity  coefficient  is  large  for  cases  when  the  vapor 
is  polar  while  the  collecting  phase  is  non-polar.  For  the  same 
vapor  and  collecting  phase,  the  amount  collected  is  proportional 
to  the  concentration  of  this  vapor  in  air  (n  )  if  the  collector 

d 

temper ature  and  consequently  nQ  are  maintained  constant. 

For  convenience,  it  can  be  pointed  out  that  n/n  represents 

3  a 

the  volume  of  air,  in  cm  .  from  which  the  particular  vapor  can  be 
fully  extracted,  and  by  rearranging  Eq,  1, 

p.  -  e  ’  io23  a  1  .  I  .  v  (ii) 

n  M  n^ 

a  o 

it  is  seen  to  be  essentially  inaependent  of  the  concentration  of 
the  vapor  in  air,  since  all  terms  on  the  right  side  are  either 
concentration- independent  ,  or  occasionally  only  partially 
dependent  as  is  the  case  with  the  activity  coefficient.  The 
ratio  n/n.  is  equivalent  to  the  gas  chromatographic  retention 

a 

volume  of  the  specific  comn^n.ent  in  the  collecting  phase  at 
collection  temperature-- usually  room  temperature. 

In  the  later  gas  chromatographic  analysis,  unless  it  is 
conducted  using  the  same  stationary  phase  as  in  the  collector 
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(Apiezon  L) ,  it  is  impossible  to  determine  for  which  components 
the  air  volume  passed  through  the  collector  has  exceeded  the 
n/na  volume.  The  most  convenient  solution  for  this  indeterminacy 
was  found  in  developing  a  fluidized  bed  collector.  Here  the 
collecting  phase,  in  the  form  of  Apiezon  L  films  dispersed  on 
Teflon  particles,  is  placed  in  a  capsule  through  which  air  to  be 
sampled  is  flown  at  a  high  rate.  The  vapors  in  air  reach  each 
particle  easily,  since  the  powder  is  in  a  violent  motion,  and  an 
equilibration  is  reached  on  all  particles  essentially  simultaneous¬ 
ly.  For  fluidized  bed  samplers  coated  with  10-wt  percent  of 
Apiezon  L,  equilibration  was  reached  within  20-30-min.  To  assure 
equilibration,  the  collection  was  continued  for  45-min. 

The  fluidized  bed  material  is  contained  in  a  cylindrical 
glass  column,  5-in  long  and  1-1/4-in  wide  (Figure  2).  The 
column  is  provided  with  stainless  steel  screen  discs  at  each  of 
its  ends.  The  diameter  of  the  screen  openings  is  selected  to 
retain  the  powder  in  the  column  during  vapor  collection  and  to 
exert  minimum  resistance  to  the  airflow. 

The  fluidized  bed  material  is  introduced  into  the  column 
through  a  side  arm,  which  is  then  sealed  off.  The  bed  material 
remains  in  the  column  during  its  conditioning,  vapor  collection, 
and  vapor  transfer.  The  bed  material  consists  of  10  g  of 
Fluoropak  80  coated  with  10%  Apiezon  L  grease;  therefore  a 
typical  fluidized  bed  sampler  contains  1  g  of  Apiezon  L  in  the 
form  of  a  film  a  few  microns  thick.  From  many  conventional  gas 
chromatographic  phases  tried,  only  Apiezon  L  permitted  later 
recovery  of  dissolved  organics  without  artifacts  (extra  peaks) 
from  decomposition  of  the  trapping  phase  itself. 

Before  each  vapor  collection,  the  bed  material  is  conditioned 
by  heating  the  bed  at  80°C  and  flushing  it  with  pure  helium  gas 
until  all  volatile  impurities  are  removed.  The  conditioning 
efficiency  is  established  by  collecting  residual  impurities  in  an 
injector  needle  (cf  Section  3.3)  and  gas  chromatographing  the 
collected  sample.  The  bed  material  is  considered  clean  when  t he 
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residual  impurities  produce  peaks  of  negligible  areas.  Temperatures 
higher  than  80°C  maintained  for  1  hr  led  to  incipient  thermal 
decomposition  of  Apiezon  L  with  formation  of  volatile  compounds. 

The  bed  material  is  then  exposed  to  air  carrying  the  vapors 
to  be  collected  and  analyzed  Air  is  either  pushed  or  pulled  by 
a  small  blower  through  the  sampler.  In  typical  vapor  collections, 
the  air  flows  at  the  rate  of  0.5  to  0.75  liters/sec,  At  rates  of 
this  order  of  magnitude,  the  powder  is  constantly  floating  and  is 
m  excellent  and  uniform  contact  with  flowing  air. 

During  vapor  collection  the  bed  is  kept  at  ambient  temperatures, 
which  are  measured  because  the  number  of  molecules  collected  through 
equilibration  is  a  function  of  temperature.  Experiments  with 
n-butanol  vapor  in  air  indicated  that  at  ambient  temperatures  the 
n-butanol  content  of  the  bed  reaches  two-thirds  of  the  equilibrium 
value  within  20  min  and  for  all  practical  purposes  reaches 
equilibrium  value  within  45  min.  The  rate-limiting  factor  is  the 
diffusion  of  the  organic  material,  e.g.,  n-butanol,  into  the 
greaselike  film.  Faster  equilibration  would  require  lower-viscosity 
films,  which,  however,  would  be  more  volatile  and  could  introduce 
artifacts  during  analysis.  Faster  equilibration  would  also  be 
achieved  at  a  higher  bed  temperature,  but  a  smaller  finally 
collected  amount  and  a  loss  of  simplicity  in  collection  would 
result , 

Since  collection  involves  equilibration,  the  actual  airflow 
rate  through  the  sampler  does  not  have  to  be  known  and  is  not 
critical  as  long  as  the  powder  floats  and  does  not  pack  at  the  end 
screens.  Therefore  it  is  unnecessary  to  measure  airflow  rates 
during  sampling.  However,  water  condensation  in  the  sampler  must 
absolutely  be  avoided,  since  condensed  water  constitutes  a 
competitive  collection  phase  having  an  unknown,  variable  volume. 

Also,  if  the  concentration  of  the  organic  components  in  air  changes 
rapidly  during  the  sampling,  the  collected  amount  will  not  follow 
the  trends  well.  Therefore  the  sampler  is  best  suited  for  sampling 
from  air  in  which  the  concentrations  of  the  organics  are  reasonably 
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stable.  The  time  constant  of  the  sampler  is  of  order  of  20  min. 
Concentration  fluctuations  that  are  much  slower  or  much  faster 
than  this  value  will  be  unimportant- 

3 ...  3 _ Vapor  _  Transf  e  r  _  a  nd  _  I  n  j  e  ct  i  on_  Appa  r  a  t  us 

Vapor  components  trapped  in  the  thin  film  m  the  fluidized 
bed  are  transferred,  by  elution  with  cryogenically  purified  helium 
gas  (Figure  3) ,  into  a  stainless  steel  thin-wall  injector  needle 
(Figure  4) .  During  transfer,  the  bed  material  is  heated  to  80°C 
and  the  injector  needle  is  cooled  to  liquid  nitrogen  temperature 
in  a  specially  designed  copper  block  immersed  at  its  bottom  end 
m  liquid  nitrogen.  The  typical  duration  of  vapor  transfer  is 
1  hr,  and  70  to  80%  of  the  organic  vapor  sample  is  transferred 
into  the  injector  needle.  Higher  efficiency  can  be  achieved  by 
longer  transfer  at  the  expense  of  analytical  time.  Higher 
temperatures  result  in  incipient  breakdown  of  Apiezon  L  and  peak 
artifacts. 

The  vapor  sample  in  the  injector  needle  is  delivered  to  the 
chromatographic  column  by  means  of  an  injection  system  (Figure  5) . 
The  essential  parts  of  the  injection  system  include  an  electrical 
motor  (A)  with  an  eccentric  cam.  a  stainless  steel  bellows  (B) , 
a  stainless  steel  cylindrical  bellows  enclosure  (C) ,  a  pressure 
gauge  (D) ,  a  stainless  steel  capillary  U-shaped  cold  trap  (E) ,  and 
a  stainless  steel  injector  needle  (F) . 

The  motor,  acting  through  the  eccentric  and  a  spring-loaded 
pushing  rod,  can  draw  a  1-cm^  volume  of  carrier  gas  from  the 
injection  port  and  into  the  bellows  and  later  employ  this  volume 
to  inject  the  sample  from  the  heated  injector  needle  into  the 
septum  of  the  column.  The  bellows  enclosure  provides  a  smooth  fit 
of  the  bellows,  with  a  minimum  of  dead  space.  The  pressure  gauge 
indicates  the  injection  port  pressure  at  any  column  temperature 
and  any  temporary  pressure  change  due  to  injection.  The  gauge 
also  serves  to  indicate,  by  a  pressure  lower  than  usual,  any  leaks 
in  the  septum/needle  system.  The  U-shaped  trap,  which  is  1/16-in 
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in  outside  diameter  and  cooled  to  liquid  nitrogen  temperature, 
blocks  the  passage  of  impurities  from  the  bellows  and  the  pressure 
gauge  into  the  injector  needle.  When  not  in  use,  the  injection 
system  is  kept  under  the  flow  of  pure  helium  gas.  During  this 
standby  interval,  helium  flows  into  the  system  at  a  measured  rate 
(flowmeter  G)  and  pressure  (pressure  gauge  H)  from  a  separate 
source  through  a  liquid  nitrogen  trap  (I)  and  a  T-joint  (K) . 

Sample  injection  involves  the  following  steps.  The  needle 
containing  the  sample  and  kept  in  dry  ice  is  attached  to  the 
injection  system  at  the  outlet  of  the  cold  trap  by  means  of  a 
Swagelok  connector  (L) .  The  helium  flow  is  then  discontinued  at 
K  by  closing  the  connection  with  a  Swagelok  nut  provided  with  a 
metal  rod.  The  other  end  of  the  injector  needle  is  attached  to 
the  injection  port  of  the  chromatograph  (M)  by  means  of  a  Conax 
connector  equipped  with  a  Teflon  sealing  plug,  and  time  is 
allowed  for  attainment  of  the  injection-port  pressure,  as  indicated 
by  the  pressure  gauge  (D) . 

A  1-cm3  volume  of  carrier  gas  is  drawn  into  the  injection 
system  by  means  of  the  motor-driven  bellows.  Again,  time  is 
allowed  for  stabilization  of  the  pressure.  To  inject  the  vapor 
sample,  the  needle  is  heated  to  the  selected  temperature  by  passing 
a  20-amp  current  along  the  needle  while  the  vapors  are  flushed  into 
the  septum  with  the  1-cm^  volume  of  carrier  gas  stored  previously 
in  the  bellows.  The  needle  temperature  rises  from  -80  to  180°C 
in  5  sec;  it  then  remains  at  180 °C  for  the  next  5  sec.  Injection 
of  helium  gas  from  the  bellows  starts  simultaneously  with  the 
heating  of  the  needle  and  continues  for  10  sec.  During  this 
interval,  the  helium  from  the  bellows  carries  the  vaporized  sample 
into  the  septum  while  the  normal  carrier  gas  flow  at  60  cm  /min 
continues. 

The  injection  system  was  tested  on  a  large  number  of  various 
samples,  and  it  was  found  to  produce  reproducible  and  practically 
quantitative  injections,  with  95  to  98%  of  the  sample  introduced 
into  the  septum  in  one  injection. 
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3.4  Calibrat ic 


Calibration  of  the  gas  chromatographic  readout  for  known 
compounds  is  relatively  simple.  To  calibrate  for  any  component 
of  the  vapor,  the  component  must  first  be  identified  and  obtained 
in  a  pure  state  from  a  commercial  or  a  synthetic  supply,*  An 
airflow  system  (Figure  6)  is  set  up  in  which  breathing-quality 
air  from  a  commercial  pressurized  tank  is  passed  through  an 
aerosol  filter  (A)  of  a  material  such  as  Teflon,  a  flowmeter  (B) , 
and  a  freshly  activated  carbon  bed  (c)  cooled  in  a  dry  ice-propanol 
mixture  (D)  at  its  exit  end.  These  precautions  remove  organic 
impurities  from  the  air.  The  purified  air  is  warmed  to  room 
temperature-  It  flows  through  a  2-liter  three-neck  flask  ( E) , 
mounted  with  the  necks  facing  downward,  into  a  clean  fluidized 
bed  trap  (F) .  One  neck  of  the  flask  is  connected  to  an  upright 
small  cylinder  (G)  with  a  small  amount  of  pure  component  on  its 
closed  bottom.  The  cylinder  is  thermostated  either  at  or  below 
the  temperature  of  the  flask. 

The  amount  of  component  that  diffuses  into  the  airstream  in 
the  flask  depends  on  the  vapor  pressure  of  the  component  and  the 
diffusion  coefficient  of  the  component  in  air.  The  diffusion  rate 
can  be  approximately  calculated  from  these  values  and  from  the 
cross  section  and  the  length  of  the  gaseous  diffusion  path 
formed  by  the  cylinder  volume  above  the  component  sample.  The 
amount  is  also  directly  measurable  by  experimentally  determining 
the  weight  loss  of  component  over  a  longer  period  of  time. 

With  a  controlled  and  known  airflow  rate  through  the  flask 
and  the  sampler,  the  molecular  concentration  of  the  component  in 
the  air  flowing  through  the  sampler  can  be  calculated.  For 
instance,  during  calibration  for  n-butanol  the  source  intensity 

*For  unknown  components,  their  approximate  concentration  in  air  can 
be  calculated  if  analysis  is  conducted  using  Apiezon  L  column,  and 
the  flame  ionization  detector  's  sensitivity  is  known;  the  gas 
chromatographic  position  gives  the  retention  volume  which  can  be 
recalculated  to  ambient  tmperature  using  Kovats  Index  system. 
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was  6  x  1015  molecules/sec  and  the  airflow  was  750  cm^/sec.  The 

concentration  of  n-butanol  in  the  calibration  stream  was  therefore 

8  x  10  molecules/cm  .  In  a  typical  calibration  this  concentration 

of  n-butanol  in  air  resulted  in  an  n-butanol  peak  with  an  area 
2 

of  220  cm  at  a  sensitivity  of  0.1  x  1  (the  highest  sensitivity 

setting  on  the  chromatograph) .  This  calibration  procedure  ties 

the  concentration  of  a  component  in  air  directly  to  the  size  of 

the  peak  on  the  chromatogram  if  a  standardized  sampler  and 

standardized  sample-handling  procedures  are  used.  In  the  above 
2 

case,  1  cm  of  peak  area  at  0.1  x  1  sensitivity  corresponds  to 
3.6  x  1010  molecules  per  cubic  centimeter  of  air,  and,  since 
smaller  peaks  are  detectable,  the  detection  limit  is  somewhat 
lower  than  parts  per  billion  by  volume. 

The  factor  relating  the  peak  area  to  the  concentration  in 
air  depends  on  the  partition  coefficient,  k,  for  the  component  i 
in  Apiezon  L  versus  air?  k.  =  (n  )  /(n  )  with  n  for  concentration 

1  C  2.  o  X  3 

in  air.  It  is  possible  to  estimate  the  value  of  this  coefficient 

from  the  amount  of  Apiezon  L  in  the  sampler,  the  estimated  sample 

transfer  efficiency  during  transfer  from  the  sampler  to  the  gas 

chromatograph  in  the  standardized  1-hr  elution  procedure,  and 

the  knowledge  (obtained  by  direct  calibration)  of  the  absolute 

2 

number  of  molecules  corresponding  to  1  cm  of  the  peak  area.  For 
n-butanol  in  Apiezon  L  at  room  temperature,  the  partition  coefficient 
was  4000.  Therefore  the  described  procedure  corresponds  to  a 
concentration  of  n-butanol  that  is  4000-fold  greater  than  that  in 
air,  and  the  analysis  is  equivalent  to  a  one-shot  injection  of 
n-butanol  contained  in  4  liters  of  air. 

If  the  molecular  weight  of  Apiezon  L  is  approximately  1000, 
the  activity  coefficient  for  n-butanol  in  this  medium  is,  very 
approximately,  3.  For  larger  polar  molecules  and  for  smaller 
but  less-polar  molecules,  lower  activity  coefficients  and 
correspondingly  higher  partition  coefficients  are  expected. 

Table  I  shows  calibration  data  for  some  polar  and  relatively 
small  molecules  for  which  Apiezon  L  is  a  poor  concentrating  medium, 
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Nevertheless,  satisfactory  collection  is  obtained  even  for  these 
vapors.  For  the  less  volatile  substances,  a  higher  relative 
sensitivity  is  expected  since  the  partition  coefficient 
increases  with  decrease  of  the  vapor  pressure.  Similarly,  higher 
enrichment  will  occur  with  other,  less  polar  vapors.  When  a 
vapor  source  is  enclosed  in  a  stream  of  air  flowing  at  a  known 
rate  through  the  enclosure  and  then  through  the  sampler,  the 
total  source  strength  can  be  calculated.  For  example,  the  rate 
of  buta  1  production  from  a  human  subject  was  found  in  one  case 
to  reach  10^’’  molecules/sec. 

3.5  Field  Sampling  Apparatus 

For  the  collection  of  vapors  emitted  by  humans  enclosed  in 
the  glass  cell  (cf.  Section  3.1)  the  fluidized  bed  sampler  was 
placed  at  the  air  exit  from  the  cell.  It  was  possible  to  place 
two  or  three  collectors  in  series  and  collect  several  comparable 
samples  simultaneously. 

For  sampling  in  open  environments,  an  apparatus  shown  in 
Figure  7  was  used.  It  consisted  of  a  12-volt  automobile  type 
vacuum  cleaner  driven  by  a  portable  (airplane  type)  12-volt 
battery.  The  cleaner  pulled  the  air  through  the  fluidized  bed 
sampler.  Provisions  were  available  for  water  circulation  through 
a  glass  jacket  integral  with  the  sampler,  to  maintain  desired 
(room  temperature)  temperature  level  if  the  outside  air  was  too 
cold  or  too  warm.  Since  fluidized  beds  act  as  very  efficient 
heat  transfer  media,  additional  pre-thermostatting  of  air 
before  entrance  into  the  sampler  would  be  necessary  if  ambient 
temperature  extremes  were  encountered.  In  the  present  work, 
environment  temperatures  were  selected  so  as  to  avoid  such 
extremes  and  simplify  the  collection  and  calculations. 
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3.6  Gas  Chromatographic  Techniques 


3.6.1  Packed  Columns 

The  primary  objective  of  the  initial  phase  of  the 
experimental  work  was  the  acquisition  of  a  set  of  gas  chromato¬ 
graphic  patterns  of  human  chemical  signatures  from  subjects  of 
various  races  and  sexes  as  well  as  from  different  environments 
under  simple  and  reproducible  analytical  conditions.  For  this 
purpose,  a  packed  analytical  column  was  selected  and  the  analyses 
were  performed  under  isothermal  conditions  at  a  moderate 
temperature.  The  selected  column  was  10-ft  long  and  1/8-in  O.D, 
coiled  stainless  steel  tubing  packed  with  20%  Carbowax  20M 
supported  on  acid-washed  60/80  mesh  Chromosorb  P.  During  each 
signature  analysis,  the  column  was  kept  at  125 °C  in  the  air  oven 
of  the  dual-channel  Aerograph  model  204  gas  chromatograph.  The 
selection  of  a  packed  column  was  based  mainly  on  its  relatively 
high  stability  or  low  rate  of  secondary  aging  -  a  favorable 
condition  for  minimizing  the  effect  of  column  age  on  its  analytical 
performance.  The  isothermal  analytical  conditions  were  chosen  with 
some  sacrifice  on  component  resolution,  to  maintain  maximum 
retention  time  reproducibility  for  each  component  of  a  complex 
mixture  analyzed. 

In  the  later  stage  of  this  investigation,  two  packed  Apiezc-n  L 
columns  were  also  used  in  the  three-column  apparatus  described 
below.  In  this  apparatus,  the  packed  columns  were  attached  to 
one  branch  of  the  splitter  connected  to  the  outlet  of  a  200-ft 
open  tubular  Carbowax  20M  column.  The  packed  columns  were 
selected  to  minimize  the  change  in  Apiezon  L  polarity  due  to  the 
effluent  impurities  from  the  Carbowax  column. 

3.6.2  High-Resolution  Column 

In  order  to  gain  a  deeper  insight  into  the  actual 
complexity  of  human  chemical  signatures,  modern  high  separating 
power  and  high  analytical  speed  columns  were  chosen  and  used  in  a 
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single  column,  a  two-column,  and  a  three-column  gas  chromatographic 
systems.  The  selected  columns  included  a  50-ft,  0.020-in  I.D. 
open  tubular  Carbowax  20M  column,  a  200-ft ,  0.020-in  I.D.  open 
tubular  Carbowax  20M  column,  a  50-ft,  0.020-in  I.D.  open  tubular 
Apiezon  L  column  and  another  200-ft,  0.020  I.D.  open  tubular 
Carbowax  20M  column,  all  support  -  coated . 

The  50-ft  Carbowax  20M  column  was  used  in  combination  with  a 
mass  spectrograph  to  help  resolve  human  signature  components  in 
the  attempts  directed  toward  their  identification. 

The  200-ft  Carbowax  20M  column  was  used  in  combination  with 
the  50-ft  Apiezon  L  column  in  the  two-column  apparatus.  The 
apparatus  was  designed  to  provide  a  primary  resolution  of  sample 
components  in  the  polar  phase  and  a  secondary  resolution  of  the 
individual  Carbowax  20M  peaks  in  the  non-polar  phase.  The  two- 
column  apparatus  was  used  in  charact er ization  of  chemical  signature 
components  obtained  from  the  same  Caucasian  female.  The  apparatus 
is  described  in  the  following  section  of  this  report. 

The  second  200-ft  Carbowax  20M  column  was  incorporated  into 
the  three-column  apparatus.  In  this  arrangement,  the  split  outlet 
of  the  long  open  tubular  column  was  connected  to  two  parallel 
short  packed  Apiezon  L  columns.  The  system  was  designed  to 
produce  simultaneously  entire  signature  patterns  in  polar  and 
non-polar  phases  and  correlate  the  patterns  by  means  of  a  specifi¬ 
cally  written  computerized  peak-matching  procedure.  The  system 
was  employed  in  the  investigation  of  several  signatures  collected 
from  a  Caucasian  female  who  also  was  the  signature  donor  for  the 
study  performed  on  the  two-column  apparatus.  The  description  of 
the  system  is  given  below. 

3.6.3  Two-Column  Apparatus 

The  modern  high-resolution  gas  chromatographic  columns 
provide  a  spectacular  resolution  of  complex  mixtures?  however, 
many  components  still  remain  unresolved  and  they  may  be  represented 
by  single  peaks.  The  two-column  apparatus  was  designed  and 
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constructed  to  effect  resolution  of  such  multi-component  peaks 
and  to  provide  data  useful  in  characterization  of  the  structure 
of  the  resolved  components. 

The  two-column  apparatus  includes  a  200-ft  open  tubular 
Carbowax  20M  column  (0.020-in  I.D.),  a  50-ft  open  tubular 
Apiezon  L  column  (0.020-in  I.D.) ,  a  50-ft  dummy  column,  a  3-way 
solenoid  valve,  two  hydrogen  flame  ionization  detectors  and  two 
sniffing  ports.  A  schematic  diagram  of  the  arrangement  of  the 
apparatus  is  shown  in  Figure  8. 

In  dual  column  analyses,  the  Carbowax  20M  column  provides  the 
primary  sample  resolution.  The  effluent  from  this  column  is 
split  1:1,  and  subsequently  all  or  any  part  of  the  split  effluent 
may  be  directed  onto  the  Apiezon  L  column  for  secondary  resolution. 
The  peak  switching  step  is  effected  by  means  of  the  solenoid  valve. 
During  this  step,  a  switching  blip  appears  on  the  chromatogram 
which  denotes  exactly  the  portion  of  the  Carbowax  20M  chromatogram 
transferred  onto  the  Apiezon  L  column.  Both  analytical  columns 
and  the  dummy  column  are  provided  with  10  cc/min  helium  flow.  The 
5  cc/min  portion  of  the  Carbowax  20M  effluent  is  supplemented  with 
35  cc/min  of  helium  and  resplit,  providing  20  cc/min  flow  to  a 
flame  detector  and  a  sniffing  port.  The  other  5  cc/min  flow  from 
the  Carbowax  column  is  mixed  with  5  cc/min  of  helium  and  the  total 
10  cc/min  is  directed  to  either  the  Apiezon  L  column  or  the  dummy 
column.  At  the  exit  of  the  Apiezon  L  column  30  cc/min  of  helium 
is  added  to  the  10  cc/min  flow  and  the  total  is  again  split  so 
that  20  cc/min  flows  to  the  second  detector  and  20  cc/min  to  the 
second  sniffing  port.  Approximately  equal  concentrations  of  sample 
reach  the  two  sniffing  ports;  this  permits  a  comparison  of  relative 
odor  intensities  when  needed. 

A  segment  of  a  typical  gas  chromatogram  obtained  in  the  dual¬ 
column  analysis  of  a  human  chemical  signature  is  shown  in  Figure  9. 
The  upper  portion  of  the  figure  represents  peaks  obtained  on 
Carbowax  20M  column;  the  peaks  marked  with  asterisks  were  injected 
on  Apiezon  L  column.  The  lower  portion  of  the  figure  shows  the 

lit  RESEARCH  INSTITUTE 


17 


components  of  the  injected  peaks  which  were  resolved  on  Apiezon  L 
column . 

The  proper  utilization  of  the  two-column  apparatus  requires 
a  knowledge  of  the  retention  dispersion  characteristics  for 
non-polar  compounds  as  well  as  for  selected  polar  compounds  in 
Apiezon  L  stationary  phase.  The  n-alkanes  are  expected  to  exhibit 
maximum  retention  on  Apiezon  L  and  the  n-alcohols  the  minimum 
retardation.  Other  functional  groups  such  as  aldehydes  and 
ketones  are  expected  to  fall  between  the  retention  times  shown 
by  the  alkanes  and  the  alcohols.  The  information  on  the  retention 
behavior  of  selected  compounds  was  acquired  in  a  series  of 
calibration  experiments.  In  these  experiments,  the  n-alkane 
series  included  CQ ,  C^q,  C,.,  ci3'  C14'  and  C15  hydrocarbons , 

the  alcohols  were  represented  by  ethanol,  1-propanol,  1-butanol 
and  1-pentanol;  the  aldehydes  included  propionaldehyde , 
butyraldehyde ,  hexanal,  heptanal,  and  octanal;  the  ketone  series 
embraced  2-butanone,  3-pentanone,  2-heptanone,  and  2-octanone. 

The  calibration  experiments  were  performed  under  the  same 
operating  conditions  and  temperature  (125 °C)  as  used  in  the 
analysis  of  human  samples.  The  Carbowax  column  effluent  was 
allowed  to  flow  continuously  onto  the  Apiezon  L  column.  For 
n-alkanes  studied  the  experimental  data  were  first  expressed  in 
terms  of  the  respective  elution  times  from  both  liquid  phases; 
these  data  guided  the  experiments  on  human  samples  by  providing 
the  maximum  time  intervals  required  for  a  complete  elution  and 
display  of  all  components  of  a  given  Carbowax  20M  peak  which  was 
injected  onto  the  Apiezon  L  column.  The  n-alkane  data  were  then 
converted  to  corrected  retention  times  and  these  were  used  in  the 
construction  of  calibration  curves  needed  in  the  characterization 
of  components  of  human  chemical  signatures  in  terms  of  their 
respective  Kovats  Indexes  on  both  stationary  phases.  The 
calibration  data  on  n-alkanes  are  given  in  Table  II. 

The  calibration  data  on  polar  compounds  were  first  converted 
to  respective  corrected  retention  times.  These  retention  times 
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were  utilized  in  determining  Kovats  Retention  Indexes  for  the 
selected  compounds  from  previously  obtained  n-alkane  calibration 
curves-  The  retention  indexes  were  determined  in  Carbowax  20M 
and  apiezon  L  stationary  phases.  The  respective  Kovats 
Indexes  were  then  employed  in  the  construction  of  calibration 
curves  for  polar  compounds  which  were  displayed  in  the  chemical 
logic  field.  These  curves  together  with  those  constructed  for 
other  compounds  from  pertinent  literature  data  served  in  the 
description  of  the  structure  of  components  of  the  human  chemical 
signatures  studied.  The  calibration  data  on  polar  compounds 
are  summarized  in  Table  III. 

In  order  to  minimize  the  inherent  differences  in  composition 
of  chemical  signatures  of  different  individuals,  the  collection 
experiments  in  this  part  of  the  work  were  limited  to  one  subject  and 
performed  in  a  period  of  a  few  days.  The  selected  subject  was 
a  young  Caucasian  female  of  high  hygienic  standards.  During  the 
experimental  period,  the  subject  was  on  an  average  diet.  A 
total  of  four  collections  was  made.  In  one  collection,  three 
fluidized  Apiezon  L  beds  were  employed  in  series  and  two  beds 
in  series  were  used  in  each  of  the  other  collections.  In  the 
three-bed  experiment,  ten  Carbowax  20M  peaks  were  injected  onto 
the  Apiezon  L  column;  in  the  remaining  collections,  eight,  four, 
and  three  multi- component  Carbowax  20M  peaks  were  examined. 

The  examination  was  performed  with  the  two-column  system  kept 
at  125 °C.  The  peaks  observed  on  Apiezon  L  column  and  originating 
from  the  respective  Carbowax  peaks  were  described  in  terms  of 
Kovats  Retention  Indexes  and  the  data  were  plotted  in  two- 
dimensional  (polar  versus  non-polar)  chemical  logic  field  to 
produce  a  tentative  description  of  the  structure  of  the  components 
studied . 


3.6.4  Three-Column  Apparatus 

The  three-column  gas  chromatographic  system  was  constructed 
to  provide  a  detailed  two-coordinate  description  (polar  and  non- 
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polar)  of  sample  components  from  one  single  analytical  run  without 
the  need  for  peak  trapping  and  reinjection.  The  system  is 
schemat ical] v  depicted  in  Figure  10.  The  essential  components  of 
the  system  include  a  200-ft  open  tubular  Carbowax  20M  column,  a 
6- ft  x  1/8- m  packed  Apiezon  L  column,  a  4-ft  x  1/8-in  packed 
Apiezon  L  column,  and  three  flame  ionization  detectors.  The 
outlet  of  the  200  ft  column  splits  into  two  branches;  one  branch 
leads  to  the  flame  ionization  detector,  the  other  branch  to  two 
parallel  Apiezon  L  columns.  The  outlet  of  each  Apiezon  L  column 
connects  to  a  flame  ionization  detector.  Helium  gas  is  delivered 
to  the  system  from  two  independent  flow  trains.  One  train  supplies 
the  Carbowax  20M  column  with  the  flow  rate  of  5cc/min,  the  other 
train  delivers  helium  to  the  Apiezon  L  columns?  the  flow  rates  are 
40cc/min  and  50cc/mm  in  the  long  and  the  short  columns 
respectively.  The  flame  ionization  detector  residing  at  the 
outlet  of  the  Carbowax  column  is  supplemented  with  auxiliary 
helium. 

Tne  long,  oper -tubular  Carbowax  20M  column  is  used  to  provide 
the  initial  optimum  sample  separation.  The  separated  sample  com¬ 
ponents  split  into  three  fractions;  two  fractions  simultaneously 
enter  both  Apiezon  L  columns;  while,  also  at  the  same  instant,  the 
third  fraction  is  detected  by  the  detector  attached  to  the  Carbowax 
column.  The  Apiezon  L  columns  are  used  to  obtain  the  retention 
behavior  for  each  sample  component  on  a  non-polar  phase.  The 
parallel  arrangement  of  these  columns  and  the  combination  of 
different  lengths  and  carrier  gas  flow  rates  provide  characteristic 
ratios  of  retention  times  which  are  useful  in  the  subsequent  peak 
matching  procedure. 

Each  component  of  the  sample  is  represented  by  three  gas 
chromatographic  peaks  which  emerge  with  characteristic  sequence 
and  elution  times.  The  f;rst  peak  appears  on  the  detector  receiving 
the  signals  from  the  Carbowax  column,  the  second  is  eluted  from  the 
short  Apiezon  L  column  and  the  third  from  the  long  Apiezon  L  column. 
An  example  of  a  typical  gas  chromatogram  (only  a  segment  is  given) 
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here)  of  a  human  chemical  signature  analyzed  on  three- column 
apparatus  is  given  in  Figure  11.  In  order  to  extract  useful 
information,  the  system  is  calibrated  with  various  homologous 
series  of  compounds 

In  a  typical  calibration  run  a  .mixture  of  homologs  and 
selected  alkanes  is  injected  into  the  system  and  analyzed  at 
constant  temperature  For  each  component  of  the  mixture  the 
characteristic  elution  times  from  all  three  columns  are  m  ■ ea 
and  converted  to  corrected  retention  data  The  conversic 
requires  the  knowledge  of  retention  characteristics  for  mt 
This  is  obtained  under  similar  analytical  conditions  by  incorpor¬ 
ating  the  component  into  the  sample  or  analyzing  methane  before 
injecting  the  sample  into  the  mixture 

The  corrected  retention  data  are  used  in  the  establishing  of 
the  peak  shifts  which  occur  on  tne  Apiezon  L  columns  These 
shifts  are  given  as  the  ratios  of  the  corrected  retention  time 
for  each  component  on  the  long  Apiezon  L  column  and  the  short 
Apiezon  L  colu.mm  The  value  of  these  ratios  which  was  found  to 
be  constant  for  all  compounds  studied  is  used  in  the  peak  matching 
procedure  in  the  interpretation  of  the  unknown  samples-  The 
corrected  retention  data  are  also  utilized  in  the  computation  of 
the  Kovats  Retention  Indexes  for  the  components  of  homologous 
series  in  the  two  stationary  phases  employed-  The  retention 
indexes  serve  to  construct  calibration  curves  which  are  displayed 
in  a  chemical  logic  field  for  final  characterization  of  components 
of  actual  samples 

The  results  of  experiments  on  calibration  of  the  three  column 
apparatus  are  summarized  in  Tables  IV  to  X-  The  data  on 
calibration  performed  using  n  alkanes  with  the  three  column 
system  kept  at  90 °C  120 °C  and  150 °C  are  given  in  Tables  IV  to 
VII  . 

It  is  noted  that  the  ratios  of  the  corrected  retention  times 
observed  on  Apiezon  L  column  for  all  compounds  studied  are 
practically  constant  and  temperature  independent  The  average 
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value  of  these  ratios  was  used  in  the  computerized  indexing  of  the 
human  chemical  signature  components. 

The  results  of  calibration  experiments  carried  out  with  polar 
compounds  are  summarized  in  Tables  VIII  to  X.  The  ratios  of  the 
corrected  retention  times  on  Apiezon  L  columns  for  all  compounds 
studied  are  also  constant  and  temperature  independent.  The  elution 
data  listed  in  these  tables  were  used  in  the  three-point  match 
procedure  designed  to  inspect  the  three  gas  chromatographic  records 
for  the  presence  of  such  compounds  in  human  chemical  signatures. 

3.6.5  Calibration  for  Conversion  to  Kovats  Indexes 
The  initial  series  of  gas  chromatographic  patterns  of 
human  chemical  signatures  was  acquired  without  calibrating  the 
analytical  system  with  a  synthetic  mixture  of  n-alkanes.  In  order 
to  evaluate  the  daily  instrumental  variations  affecting  the 
positions  of  the  signature  components  in  their  respective  patterns 
and  to  assign  Kovats  Indexes  to  those  components,  a  set  of  two  peaks 
always  observed  in  each  pattern  was  selected  and  used  as  pattern 
"internal  standard".  One  of  the  selected  peaks  emerged  in  the 
initial  section  of  the  patterns  with  an  average  retention  time  of 
2.80  +  0  09  min.  (corresponding  to  n-butanol) ,  the  other  peak  was 
observed  in  the  final  section  of  the  patterns  with  an  average 
retention  time  of  27.84  +  0.47  mm.  The  results  of  the  retention 
time  standardization  for  the  initial  series  of  the  chemical  signature 
patterns  were  then  correlated  with  those  obtained  in  the  second 
series  of  patterns  which  were  standardized  with  a  synthetic  mixture 
containing  hexane,  octane,  decane,  dodecane  .  tetradecane ,  hexadecane, 
and  1-butanol.  The  correlation  facilitated  the  assignment  of  the 
statistical  pattern  analyses  of  both  series  of  patterns  and 
provided  data  needed  in  the  computation  of  Kovats  Indexes  for  all 
pattern  components. 

The  Kovats  retention  indexes  were  computed  for  each  component  of 
the  signatures  by  using  the  following  equation: 

1(A)  =  100  •  (y-x)  l0g  {}^a/t_rx)  +  (i00  *  x) 

log  (try./tj.x) 
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where 


1(A)  =  retention  index  of  compound  A 

tra  =  retention  time  of  compound  A 

trx  r  retention  time  of  n  alkane  CxH2x  +  2 

t r y  -  retention  time  of  n  aLkane  CyH2y  +  2 

For  most  of  the  components  the  calculations  were  made  by 
interpolating  for 


Only  a  few  values  were  obtained  by  extrapolation.  To  satisfactorily 
determine  the  absolute  retention  values  needed  in  the  computation 
of  the  Kovats  Indexes  it  is  necessary  to  know  the  gas  hold-up 
time  of  the  column  under  standardized  analytical  conditions-  Since 
it  is  impossible  to  measure  this  value  if  the  detector  is 
insensible  to  inert  gases  the  retention  time  for  methane  is 
usually  taken  as  equal  to  the  gas  hold-up  time-  The  value  was 
measured  (0  31  min)  and  used  to  correct  the  retention  times  of  the 
components  of  the  h-iman  signature  and  synthetic  mixture 

The  patterns  of  human  signatures  were  then  compared  with  those 
of  various  environments  and  Kovats  Index  ranges  for  Carbowax  20M 
peaks  were  selected  which  were  most  significant  for  discrimination 
of  humans  and  environments 

In  order  to  obtain  a  more  precise  characterization  of  the  most 
significant  components  of  signatures-  a  systematization  process  was 
introduced  to  treat  the  data  obtained  on  the  high- resolution  two- 
column  apparatus.  The  principle  underlying  the  process  is 
illustrated  m  Figure  12. 

The  X-axis  is  the  Kovats  Index  scale  for  retention  in  the 
Carbowax  20M  column  The  Y-axis  is  the  delay,  in  minutes,  in  the 
Apiezon  L  (second  stage)  column  after  correction  for  hold-up  time 
which  was  obtained  by  calibration  with  methane  The  Kovats  Index 
scale  that  corresponds  to  these  delays  in  shown  on  the  right.  The 
upper  diagonal  line  represents  the  delay  for  the  n-alkanes  and 
other  non-poiar  saturated  hydrocarbons  This  is  then  the  maximum 
delay  which  a  component  with  a  given  Kovats  Index  in  Carbowax  20M  can 
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experience  in  the  non-polar  Apiezon  L  column.  If  the  compound 
is  somewhat  polar,  or  possesses  double  bonds,  it  will  be  delayed 
less  and  its  point  in  the  two-index  field  of  Figure  12  will  be 
lower.  The  zones  where  various  types  of  compounds  reside,  on 
the  basis  c*  literature  data,  are  indicated.  Straight  chain 
alcohols  are  the  most  polar  of  the  compounds  considered  here 
(the  glycol  zone  is  also  shown)  and  are  located  on  a  straight 
line.  This  is  drawn  using  our  own  experimental  points.  Points 
tor  some  other  compounds  not  reported  in  literature  are  also 
shown. 

Branched  alsohols,  or  alcohols  with  OH  groups  not  on  the 
terminal  carbon  atom  lie  above  the  n-alcohol  line,  but  short  of 
the  carbonyl  compound  zone.  Double  bonds  e.g..  in  the  alcohols, 
bring  the  points  to  a  zone  somewhat  below  the  non-double  bond 
compounds  of  the  same  type.  Notice  that  the  point  for  n-butanol 
lies  in  the  alcohol  zone  of  the  continuation  of  the  line  formed 
by  higher  n-alcohols  and  at  the  same  horizontal  and  vertical 
distance  from  hexanol-  In  this  particular  case,  only  a  rather 
unusual  compound  (e.g. ,  some  carbonyl  compound  with  more  than  one 
double  bond)  has  an  outside  chance  to  produce  a  point  that  might 
be  confused  with  n-butanol. 

A  set  of  calibration  compounds  was  selected  to  permit 
frequent  calibration  of  a  plot  of  Figure  12  for  several  homologous 
series.  This  is  desired  to  increase  precision  and  reliability, 
since  although  the  plot  as  it  is  represented  appears  to  be 
reliable  within  +  10  K.I.  units,  the  actual  columns  used  can 
differ  slightly  from  the  performance  of  columns  on  the  be 'is  of 
which  Figure  12  was  drawn.  The  causes  for  difference  are  slight 
differences  in  the  average  molecular  weight  of  Apiezon  L,  slight 
oxidation,  contribution  of  adsorption  at  column  walls,  etc. 

3.7  Mass  Spectroqraphic  Identification  Attempts 

Various  attempts  were  made  to  identify  components  of  human 
chemical  signature  by  mass  spectrography .  The  initial  experiments 
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were  performed  on  signatures  collected  from  a  Caucasian  male  and 
from  an  Indian  male.  in  these  experiments,  the  packed  analytical 
Carbowax  20M  column  was  connected  to  the  Hitachi  Perkin-Elmer 
model  RMU-6D  mass  spectrometer  with  a  Watson  Bieman  sample 
enrichment  device.  The  column  effluent  with  human  signature 
components  was  split  so  that  lOcc/min  entered  the  ion  chamber  and 
50cc/min  the  flame  ionization  detector.  The  analysis  was  performed 
at  125  °C . 

Another  series  of  experiments  was  designed  to  evaluate  the 
minimum  amount  of  component  needed  for  its  positive  mass 
spectrographic  identification  to  collect  larger  human  samples  and 
to  improve  its  resolution.  For  this  purpose,  100-g  Apiezon  L 
fluidized  bed  was  constructed  and  tested  in  an  experiment  on 
signature  collection  from  an  Indian  male.  To  improve  the 
component  resolution  and  to  facilitate  the  interpretation  of 
mass  spectrographic  data,  a  50-ft  x  0.020-in  I.D.  support-coated 
open  tubular  column  with  Carbowax  20M  as  the  liquid  phase  was 
employed.  The  column  effluent  was  fed  directly  into  the  ion 
chamber  at  the  rate  of  lOcc/min.  During  the  analysis,  the  column 
temperature  was  programmed  at  2 °/min  from  60-180 °c.  The  peaks 
were  detected  on  picnoammeter  and  timed  from  the  instant  of 
sample  injection  to  the  appearance  of  the  maximum  current  for 
each  peak  investigated.  In  experiments  with  standard-size  beds, 
each  mass  spectrographic  analy»ir,  was  preceeded  by  gas  chromato¬ 
graphic  analysis  of  human  sample  collected  from  the  same  individual. 
This  was  effected  by  collecting  samples  on  several  beds  in  series. 
The  samples  were  collected  from  a  young  Caucasian  female. 

The  minimum  amount  of  compound  needed  for  positive  mass 
spectrographic  identification  was  estimated  with  a  mixture  of 
1-butanol  and  heptanal.  For  this  purpose,  various  aliquots  of 
the  mixture  were  analyzed  under  the  same  conditions  as  used  for 
human  sample.  The  aliquots  were  selected  to  match  major  and 
medium  size  peaks  obtained  from  human  sample.  In  each  case, 
attempts  were  made  to  identify  the  selected  compounds  by  mass 
spectrography . 
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3.8  Statistical  Pattern  Analyses 

As  the  gas  chromatographic  data  on  chemical  signatures  of 
humans  obtained  by  the  f]uidized  bed  collector  technique  began 
to  accumulate,  it  became  evident  that  the  composition  of  the 
organic  vapors  from  humans  varied  significantly  from  individual 
to  individual.  Visual  inspection  of  the  chromatograms  from 
males  and  females  did  not  indicate  obvious  common  patterning 
(except  e.g.,  such  as  a  peak  in  that  location  where  n-butanol 
should  occur)  nor  obvious  group  differences.  At  that  stage  of 
work,  patterns  consisted  of  32  to  48  peaks;  the  sizes  of  the  peaks 
in  the  same  gas  chromatographic  location  varied  within  a  wide 
range  from  individual  to  individual,  and  a  comparison  of  any  two 
gas  chromatograms  indicated  that  always  a  fraction  of  peaks  could 
be  approximately  matched  with  respect  to  their  positions  on  the 
gas  chromatograms,  while  others  would  not  match. 

The  factors  that  could  explain  these  experimental  findings 
were  several.  Some  did  not  become  apparent  until  high-resolution 
two- column  gas  chromatography  was  employed  toward  the  end  of 
the  project  work;  higher  resolution  chromatography 
indicated  that  60  to  100  peaks  could  be  observed,  and  the  two- 
column  apparatus  (Section  3.6.3)  demonstrated  that  most  of  these 
still  could  be  further  resolved  into  2  to  10  peaks,  each  corres¬ 
ponding  to  at  least  one  organic  component.  Hence,  the  32  to  48 
peaks  observed  in  analysis  in  packed  column  (Section  3.6.1)  each 
corresponded  to  mixtures  of  several  components,  close  in  their 
retention  times,  and  the  position  of  the  compound  peak  would  be 
expected  to  shift  as  the  proportion  of  the  components  in  the 
compound  peak  would  change  from  the  individual  to  individual. 

Other  causes  of  the  peak  position  shifts  may  be  the  usual 
experimental  fluctuations  m  the  retention  times,  and  the  presence 
of  some  water  in  the  samples.  The  amount  of  water  in  each  injected 
sample  was  estimated  at  a  few  miligrams.  The  hydrogen  flame 
ionization  detector  will  not  respond  to  water  vapor  to  a  significant 
extent,  but  in  the  Carbowax  20M  analytical  column  the  water,  while 
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passing  through  the  column,  can  temporarily  increase  the  polarity 
of  this  stationary  phase  and  car  for  short  time  intervals, 
influence  the  rate  of  migration  of  the  components  through  the 
column.  The  retention  times  of  the  less  polar  components  will 
slightly  decrease  while  those  of  the  polar  components  will  increase. 

Because  of  the  large  variety  of  the  gas  chromatographic  features 
observed,  it  became  necessary  to  develop  criteria  for  selection  of 
those  features  that  would  be  most  significant  for  identification 
and  classification  of  human  chemical  signatures.  At  this  juncture, 
a  series  of  statistical  pattern  analysis  methods  we  e  applied. 

In  the  present  section,  the  reasons  for  use  of  the  different 
methods  will  be  briefly  outlined;  the  procedures  and  the  results 
are  discussed  in  Section  4.2  and  its  subsections. 

Procedure  A 

Various  similarity  indexes  have  been  proposed  in  literature 
for  comparisons  of  patterns  or  stimuli.  In  general,  the  indexes 
attempt  to  compare  the  extent  of  those  features  which  are  similar 
with  those  which  are  dissimilar,  or  with  the  total  i  nput  of 
features.  The  crucial  part  of  the  procedure  is  the  decision  making 
on  the  correspondence  of  features.  In  the  case  ot  the  gas 
chromatographic  patterns,  the  positions  of  the  peaks  in  two  gas 
chromatograms  may  either  match,  or  mismatch  to  varying  degrees. 

The  decision  must  be  made  as  to  how  much  mismatch  can  be  allowed 
while  still  considering  that  one  deals  with  the  same  feature  in 
both  gas  chromatograms.  A  computer  program  was  developed,  designated 
GASCHR  to  compare  each  peak  in  each  chromatogram  with  peaks  in  the 
other  gas  chromatograms  with  prescribed  values  of  tolerance 
(typically  4  percent  m  retention  time)  in  mismatch.  The  peak 
matching  requires  certain  positive  or  negative  strain  if  the  match 
is  not  exacf ,  and  the  total  algebraic  sum  of  the  strain  over  those 
peaks  which  /ere  matched  was  calculated  and  printed  out.  Usually, 
when  approximately  the  same  numbers  and  amounts  of  the  negative 
were  required,  the  sum  would  not  differ  too  much  from  zero.  Some 
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peaks  were  matchable,  within  the  prescribed  tolerances,  to  two 
peaks  in  the  other  gas  chromatogr am . 

The  peak  matching  generated,  tor  each  pair  of  patterns,  a 
basis  for  calculating  the  various  similarity  indexes: 

Tanimoto  Index*.  This  similarity  index  is  +1  when  the  two 
patterns  match  exactly,  and  zero  when  the  patterns  do  not  match 
at  all  (wherever  there  is  a  peak  m  one  pattern,  there  is  none 
in  the  other) . 

The  Tanimoto  index  is  calculated  as  follows:  After  matching 
the  two  patterns  within  some  prescribed  tolerance  limits  (e.g.  , 
in  one  variant  of  our  program,  peaks  whose  retention  times  in  the 
two  patterns  differed  less  than  by  4  percent  were  considered 
matched  and  forming  ar.  "identical  pair";  if  one  peak  in  one  pattern 
had  several  candidate  matching  peaks,  within  the  chosen  tolerance 
limit,  in  the  other  pattern,  the  peak  which  matched  best  was  chosen 
as  the  partner  of  the  matching  pair) ,  sets  of  peak  pairs  are  taken 
to  describe  the  correspondence  between  the  patterns.  Those  peaks 
of  the  first  pattern  which  had  no  match  in  the  second  pattern,  or 
in  reverse,  formed  pairs  m  which  one  partner  was  zero.  The 
Tanimoto  index  ST  is  the  ratio  of  the  sum  of  the  smaller  par.-1,  ers 
of  the  pairs  divided  by  the  sum  of  the  larger  partners  of  the  pairs. 
A  derivative  measure  of  the  similarity  is  Tanimoto  "distance" 
between  the  patterns 

*  T-s  *\  - 
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It  is  zero  when  the  patterns  are  identical,  and  infinite  when 
the  patterns  completely  mismatch.  One  of  the  advantages  of  this 
index  is  its  ease  of  computation  and  a  lack  of  bias  in  favor  of 
large  differences  in  one  or  a  few  features  such  as  exist  in  the 
similarity  indexes  using  quadratic  terms.  Since  the  large 
differences  in  a  few  features  cannot  a  priori  be  assumed 
significant  in  pattern  classification,  it  is  undesirable  to 
overemphasize  their  influence  in  forming  the  similarity  index. 

Ekman's  Index.  Mathematically,  it  is  a  simple  function  of 
Tanimoto  index,  but  it  is  closer  to  the  psychophysical  concept 
of  similarity  presumably  operative  in  the  comparison  of  two 
stimuli  by  humans  and  developed  independently  of  Tanimoto.  Ekman 
and  Eisler*  consider  that  the  similarity  concept  follows  a 
process  that  is  mathematically  equivalent  to  taking  the  sum  of 
those  parts  of  attributes  common  to  both  stimuli  (here  we 
consider  patterns  as  "stimuli")  and  dividing  this  sum  by  the  sum 
of  both  full  stimuli.  Thus,  Ekman's  similarity  index  for  two 
lengths  A  and  B  is: 

S_  =  A  +  A 

E  aTb 

where 
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This  simple  concept  apparently  suffices  for  reasonably  good 
stimuli  recognition  by  human  .senses. 

Taking  then,  a  bionic  approach,  we  adapted  Ekman's  concept 
for  the  comparison  of  two  patterns  through  the  following  procedure 


*G.  Ekman,  "Some  Aspects  of  Psychophysical  Research  in  Sensory 
Communications,"  Rosenlith,  ed. ,  MIT  Press,  1961,  p.  35. 
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1.  Each  pattern  is  first  normalized  within  itself. 

2.  "Corresponding"  peaks  within  both  patterns  are  indexed 
by  using  the  4%  rule.  This  leaves  surplus  peaks  in  both 
patterns.  The  result  is  a  set  of  pairs  of  attributes. 

For  those  peaks  that  have  no  corresponding  peak  in  the 
other  pattern,  the  corresponding  peak  has  zero  intensity. 

3.  The  normalized  intensities  of  each  pair  of  corresponding 
peaks  are  written  side  by  side. 

4.  The  smallest  sides  of  each  of  the  corresponding  peaks  are 
added.*  the  sum  is  multiplied  by  two. 

5.  All  normalized  intensities  for  both  patterns  are  added. 

6.  The  result  of  step  4  is  divided  by  that  of  step  5,  and 
gives  Sg  (Ekman's  similarity  index).  It  will  be  small 
(approaching  zero)  if  the  similarity  is  small;  it  will  be 
+1  for  exactly  similar  patterns. 

7.  The  similarity  index,  Sg,  is  converted  to  a  distance 
value  by  taking  -log2  S£.  For  identical  patterns,  the 
"distance"  is  zero.  For  highly  dissimilar  patterns,  the 
distance,  De«  is  a  positive  number  exceeding  1,  2,  or 
more . 

Correlation  Coefficient. 

This  is  the  well-known  classical  measure  of  correlation  between 
two  sets  of  observations,  or,  in  the  case  of  pattern  comparisons, 
of  two  patterns  at  a  time,  aligned  so  that  the  peaks  match  in  their 
retention  times  within  some  prescribed  tolerance  limits. 

The  classical  correlation  coefficient  (very  related  =  +1) ; 
unrelated  =  0;  oppositely  related  =  -1)  also  gives  intuitively 
correct  values  for  distances  between  points  in  the  correlation 
space,  but  is  more  difficult  to  compute.  Also,  it  can  give  a 
value  of  +1  for  patterns  in  which  additive  and  multiplicative 
differences  compensate,  while  patterns  are  intuitively  dissimilar. 

In  the  statistical  experiments  with  the  similarity  indexes, 
additional  variations  were:  Tanimoto  index  averaged  over  peaks, 
Ekman  index  summed  over  and  divided  by  the  number  of  pairs.  To 
minimize  the  influence  of  the  quadratics  of  the  correlation 
coefficient,  a  correlation  coefficient  applied  to  normalized  area 
X  transformed  logarithmically 

y  =  log  (x  +  0.02) 
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was  included,  in  addition  to  the  correlation  coefficient  applied 
directly  to  the  normalized  areas*. 

Tne  rationale  behind  the  study  of  the  similarity  indexes  was 
the  possibility  of  developing  a  taxonomy  (classification)  of  the 
patterns.  If  a  third  pattern  has  high  similarity  index  to  each 
of  the  two  patterns  which  also  have  a  high  similarity  index, 
these  three  belong  to  the  same  class.  Otherwise,  a  new  class  can 
be  set  up  and  each  next  pattern  tested  for  belonging  to  one  or 
the  other  class  etc.  Classes  can  also  branch,  cf.  Figure  13. 

Here  pattern  A  has  some  similarity  to  B  (2  of  5  features  match, 

=  0.4) ,  C  has  more  similarity  to  B  (3  of  5  features  match, 

ST  =  0.6)  but  A  and  C  are  completely  dissimilar  (none  of  the 
features  match)**.  Thus  interrelation  between  classes  can  be 
gradually  developed,  and  basically  must  use  r.on-Eucl idian  spaces 
for  hypergeometric  representations  (in  Euclidian  space,  if  A  is 
close  to  B  and  C  is  close  to  B,  A  and  B  cannot  be  infinitely  far 
apart) . 

Although  the  taxonomy  development  using  similarity  indexes 
indicated  that  vapors  from  similar  sources  in  average  tended  to 
produce  patterns  with  higher  similarity  indexes,  the  indexes  per  se 
were  low  indicating  that  much  extraneous  noise  (non-essential 
features)  interfered  with  the  classifications.  Therefore  other 
methods  were  applied. 

Procedure _ B 

In  this  procedure,  a  more  mechanical  approach  was  taken  to 
reduce  the  patterns  to  a  definite  number  of  variables  to  which 


*Durtng  this  stage  of  work,  the  normalized  areas  were  taken  as 
the  areas  divided  by  the  mean  area  of  peaks  in  the  particular 
pattern.  In  later  work,  second  part  of  Procedure  D,  the 
normalization  was  by  dividing  the  peak  areas  by  the  total  area 
of  all  peaks  in  the  respective  pattern. 

**Tanimoto  distance  between  Tanimoto  units  B  and  C  is 
-ln2  (0.6)  =  0.73. 
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then  stepwise  discriminant  analysis  program  was  applied.  The 
chromatograms  (initially  in  terms  of  retention  times  normalized 
to  n-butanol,  later  in  terms  of  Kovats  Indexes)  were  divided  into 
ranges  of  equal  width,  (or  percentage  width),  disregarding  the 
initial  and  the  last  portion  of  the  gas  chromatograms.  Peaks  with 
the  position  of  the  peak  in  the  particular  interval  were  considered 
to  be,  with  their  full  area  totally  in  this  interval.  If  several 
peaks  occurred  in  the  same  range,  their  areas  were  summed  and  the 
sum  became  the  intensity  coefficient  for  this  range  in  the 
respective  gas  chromatogram.  Thus,  a  "variable"  was  the  total 
content  of  organic  substances  eluted  during  the  particular  range, 
with  peaks  in  this  range.  By  increasing  the  width  of  the  ranges, 
the  number  of  range  boundaries  could  be  decreased,  reducing  the 
number  of  cases  when  the  same  substance  eluted  close  to  the 
boundary  could  occur  in  one  or  the  adjoining  cell. 

After  reduction  of  the  chromatograms  to  a  set  of  number  of 
variables  stepwise  discriminant  analysis  was  conducted,  using 
SMD07M  Stepwise  Discriminant  Analysis  Computer  Program,  Version 
of  September  1,  1965,  Health  Sciences  Computing  Facility, 

University  of  California,  Los  Angeles. 

Procedure  C 

This  procedure  also  used  stepwise  discriminant  analysis,  but 
instead  of  selecting  the  variables  by  mechanical  subdivision  into 
ranges,  a  histogram  study  was  conducted  to  find  those  gas 
chromatographic  locations  (in  terms  of  Kovats  Indexes)  in  which 
peaks  most  frequently  occurred  in  the  chemical  signatures  of 
humans.  The  variables  were  then  formed  by  putting  the  center  of 
the  range  at  the  maximum  frequency  point  and  allowing  +  10  Kovats 
Index  units  as  an  estimated  tolerance  for  the  two  boundaries  of 
the  same  range.  Thus,  in  this  procedure,  the  variables  were 
histogram-guided  ranges,  not  necessarily  with  common  boundaries 
scattered  throughout  the  gas  chromatograms.  Again,  areas  of  all 
peaks  with  the  apex  in  the  range  formed  the  intensity  value  for 
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the  particular  variable. 


Procedure  D 

This  procedure  was  a  further  development  of  the  Procedure  C, 
and  differed  simply  by  introduction,  into  the  stepwise  discriminant 
analysis,  of  the  "non-peak"  ranges,  between  the  ranges  where  peaks 
were  frequently  observed  in  humans. 

The  above  various  procedures  were  applied  to  all  peaks,  or, 
in  some  trials,  only  to  peaks  within  certain  peak  size  classes, 
(e.g.,  only  large  peaks,  above  certain  size,  etc.) 

The  stepwise  discriminant  analysis  looks  for  those  features 
which  are  most  significant  in  discriminating  two  or  more  groups 
of  patterns,  and  forms  discriminant  functions  which  are  linear 
equations  of  the  form: 

x  =  a ,  V ,  +  \  vk  +  •  •  • 

Here  a ,  ,  a^  etc.  are  coefficients  derived  by  the  computer 
program,  while  V,,  etc-,  are  the  gas  chromatographic  variables. 
The  V's  were  not  taken  as  the  direct  sums  of  peak  areas  within 
the  particular  gas  chromatographic  range  but  were  logarithmic 
transforms  for  the  corresponding  areas,  so  as  to  give  more 
equitable  weights  to  large  and  small  peaks.  In  the  Procedure  D, 
the  transform  was,  e.g. , 

1132  r 

v34  =  I  [f  +  log  {Ai  +  0,1 
1112 

where  34  is  the  "name"  of  the  range;  1112  and  1132  are  its 
boundaries  in  terms  of  Kovats  Indexes;  is  the  peak  area  of 
a  peak  within  this  range,  taken  in  terms  of  its  area  as  percent 
of  the  sum  of  areas  of  all  peaks  in  the  same  pattern.  If  i  was 
the  only  peak  in  the  range,  the  expression  in  the  square  brackets 
became  V^.  If  there  were  several  peaks  in  the  range,  was 

the  sum,  as  indicated,  of  the  logarithmically  transformed  peak 
areas. 
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Discrimination  means  that  a  function  X,  or  several  functions 
(X , Y)  are  found  such  that  their  values  for  all  patterns  of  one 
class,  (e.g.,  humans)  are  distinctly  different  from  the  values 
for  another  class  (environments) . 

These  functions  can  then  be  used  to  discriminate  one  group 
of  patterns  from  another.  The  same  principle  applies  to 
discrimination  of  several  groups  of  patterns. 

In  the  stepwise  discriminant  analysis  the  success  of  the 
procedure  can  be  judged  by  two  criteria.  If  the  number  of 
variables  used  is  small  compared  with  the  number  of  patterns  used 
(much  less  than  by  a  factor  of  2) ,  and  function  that  separates 
the  patterns  in  two  (or  more)  groups  is  found,  the  probability 
that  this  is  a  valid  discriminant  function  is  high.  The 
probability  increases  as  the  number  of  patterns  is  increased  and 
the  number  of  variables  needed  to  classify  decreases. 

The  other  criterion  is  more  rigorous.  If  the  discriminant 
function  is  obtained  by  using  only  a  part,  e.g.,  one  half  of  all 
of  the  patterns  (each  pattern  used  fully) ,  its  validity  can  be 
tested  using  the  values  of  variables  from  the  other  half  of  the 
patterns.  If  the  other  half  of  the  patterns  is  satisfactorily 
classified  into  its  proper  group,  the  probability  is  high  that 
the  variables  selected  by  the  discriminant  analysis  constitute 
truly  differentiating  pattern  class  features. 

3 ,_9 _ Subject  Selection  and  Preparation 

The  signature  collection  experiments  were  performed  on  human 
subjects  of  both  sexes  and  various  races.  Except  for  two  middle- 
age  Caucasian  males,  all  subjects  were  young,  healthy  specimens. 
Before  the  participation  in  the  experiments,  the  subjects  did  not 
use  any  cosmetic  preparations  except  for  neutral  soap,  and  lived 
on  normal  mixed  diets.  To  expose  the  maximum  area  of  skin  to  air 
flow  the  subjects  were  closed  in  the  cleaned  glass  system  partially 
naked. 

The  selected  human  races  included  American  Caucasian  males 
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and  females,  Indian  males,  American  Negro  subjects,  and  Mexican 
and  Malayan  subjects. 

The  majority  of  the  Caucasian  males  were  students  at  the 
Illinois  Institute  of  Technology  in  Chicago.  The  student  group 
included  two  pairs  of  brothers.  The  rest  of  the  Caucasian 
males  comprised  two  chemists,  one  free-lance  writer  and  one 
chemical  laboratory  technician. 

The  group  of  Caucasian  females  was  composed  of  young  office 
workers  employed  at  IIT  Research  Institute,  house-wives  and  a 
few  high-school  students.  This  group  included  a  mother  and  her 
two  daughters. 

The  Indian  group  consisted  of  young  males  born  in  India  with 
temporary  residence  in  the  United  States.  All  members  of  this 
group  were  students  at  the  Illinois  Institute  of  Technology  in 
Chi cago . 

The  American  Negro  group  included  one  young  male  chemist 
and  two  young  females.  The  male  subject  was  a  husband  of  one 
of  the  females. 

The  last  group  included  a  young  Mexican  house-wife  born  in 
the  United  States  and  a  young  Malayan  male.  The  male  subject  was 
also  a  student  at  the  Illinois  Institute  of  Technology  in  Chicago. 

Other  sources  of  chemical  signatures  consisted  of  kitchen 
and  laboratory  air  and  tropical  vegetation.  The  collection  of 
these  signatures  were  performed  by  means  of  the  portable  air 
sampling  apparatus.  The  kitchen  air  samples  were  obtained  from 
the  IIT  Research  Institute  Cafeteria  kitchen;  also  the  laboratory 
samples  and  machine  shop  samples  were  taken  at  IITRI. 

The  chemical  signatures  from  tropical  vegetation  were 
collected  in  the  Palm  House  of  the  Lincoln  Park  Conservatory  in 
Chicago.  The  collection  environment  included  various  types  of 
palm  trees,  such  as  sentry  palms  (Howea  forsteriana) ; 
fishtail  palms,  a  native  of  India  and  the  Malay  region,  (Carvota 
mitis) ;  and  scattered  through  the  underplanting  of  Fittonia ,  a 
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number  of  ornamental  palms,  such  as  Fiji  Island  fan  palms 
(Eupritchardia  pacifica) ,  silver  palms  (Cqccothri nax  arqentea) , 
and  roebelen  palms  (Phoenix  roebelem) .  There  is  also  a  50-ft 
fiadleleaf  rubber  tree  (Ficus  lyrata)  from  Africa,  a  tapioca 

plant  (Mauihot _ exculenta)  ,  a  fig  tree  (Ficus  carica)  ,  and  the 

common  banana  (Musa  saplentum)  -  The  Conservatory  also  houses 
flowerless  plants  such  as  ferns  and  mosses.  The  temperature  and 
the  humidity  of  the  Palm  House  environment  range  from  60  to  85 °F 
and  from  50  to  75%,  respectively. 
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4.0  RESULTS  AND  DISCUSSION 


4.1  Packed  Column  Data 

The  experimental  data  on  human  chemical  signatures  of  both 
sexes  and  various  races  as  well  as  of  different  environments  are 
tabulated  in  the  form  of  copies  of  computer  printout  in  Appendix  I. 

In  the  tabulation,  each  pattern  is  identified  by  the  proper 
number,  the  signature  source  ("subject":  initials  of  humans), 
the  date  of  signature  collection,  the  total  intensity  and  the 
mean  peak  intensity. 

The  arrangement  of  patterns  follows  approximately  the 
chronological  order  of  signature  collection  experiments  and  is 
numbered  accordingly. 

The  total  intensity  is  defined  as  the  total  area,  in  sq.  mm., 
of  all  gas  chromatographic  peaks  of  the  particular  pattern  and 
the  mean  peak  intensity  as  the  total  intensity  divided  by  the 
number  of  peaks  in  the  same  pattern. 

The  data  on  each  pattern  tabulated  in  Appendix  I  includes 
also  consecutive  numbers  of  peaks  in  the  pattern,  the  retention 
times  of  peaks  in  terms  of  Kovats  '  dexes,  the  peak  areas  in  sq.  mm. 
(calculated  to  the  electrometer  sensitivity  setting  of  0.1  x  1) 
and  the  peak  normalized  areas  (peak  area  divided  by  the  total 
intensity) . 

The  gas  chromatographic  patterns  of  the  sic  latures  studied 
reflect  complex  and  only  partially  resolved  mixtures.  The  number 
of  detectable  components  observed  in  human  signature  patterns 
ranged  from  32  to  48  with  many  indication  for  the  presence  of  a 
large  number  of  additional  unresolved  peaks.  The  magnitude  of  the 
collection  samples  ranged  from  2.6  x  10~^g  for  an  Indian  male  to 

2.1  x  10~5g  for  a  Caucasian  male. 

The  total  number  of  detectable  components  in  the  environment 
chemical  signatures  ranged  from  34  to  46  with  a  strong  evidence 
for  the  presence  of  many  unresolved  peaks.  The  magnitudes  of  the 
collected  samples  fell  within  the  range  of  1.9  x  10“^g  and 

1.1  x  10_5g. 
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it  :..u a l  understood  that  each  peak  in  the  tabulation  can 

represent  more  than  one  substance  and  is  resolvable,  as  later 
work  with  the  two-column  device  showed,  into  several  peaks  when 
examined  in  a  non-polar  Apiezon  L  column. 

The  peak  areas  are  approximately  proportional  to  the  sum  of 
weight  of  the  components  represented  by  the  peak.  Table  XI  lists 
the  relative  amount  of  area  contributed  per  atom  of  the  respective 
element* . 

Since  the  number  of  heteroatoms  compared  with  the  number  of 
carbon  atoms  in  common  compounds  with  reasonable  molecular  size 
is  small,  the  area  of  multicomponent  peaks  can  be  assumed  to  be 
approximately  proportional  to  the  weight  of  material  it 
represents . 

2 

The  peak  areas  in  the  Appendix  are  quoted  in  mm  ,  taken  at 
the  electrometer  attenuation  of  0.1  x  1.  For  the  particular 
instrument,  the  calibration  factor  was  approximately: 

1  mm^  =  0.2  x  10~12g 

The  peak  areas  represent  the  amount  of  the  organic  material 
within  the  peak  range  injected  into  the  chromatograph.  The  amount 
present  in  the  collected  sample  is  estimated,  from  the 
efficiencies  of  sample  transfer  and  injection  to  be  30  percent 
higher. 

The  concentration  of  the  respective  materials  in  air  from 
humans  could  be  calculated  only  if  the  calibration  factor 
(partition  coefficient)  was  available  for  the  particular  molecular 
species:  Table  XII  represents  data  on  the  emission  rates  of 
n-butanol  from  humans.  This  substance  formed  a  recognizable  peak 
used  for  internal  calibration. 


* Johns  &  Sternberg,  Gas  Chromatographic  Detectors,  in 
Instrumentation  in  Gas  Chromatography,  p.  179,  Krugers,  ed.  , 
Centrex  Publishing  Co.,  Eindhoven,  Holland,  1968. 

IIT  RESEARCH  INSTITUTE 


38 


For  unknown  substances,  very  approximate  calibration  is 
possible  provided  that  Kovats  Index  of  the  unknowns  in  Apiezon  L 
is  known  and  taken  to  be,  at  room  collection  temperature,  the 
same  as  at  elevated  temperatures.  The  amount  G  of  a  substance 
X  collected  is 

G  =  k  •  m  •  v 

Here  k  is  the  partition  coefficient  of  the  substance  in 
Apiezon  L/Air  system  at  the  ^jllection  temperature  (ambient) ; 
m  is  the  concentration,  in  grams/cm  ,  of  the  substance  in  air, 
and  v  is  the  volume  of  Apiezon  L  in  the  collector.  A  plot  of 
log  k  versus  the  Kovats  Indexes  of  the  substances  in  Apiezon  L 
is  a  straight  line  and  can  be  used  for  approximate  estimates  of 
k;  v  is  known,  end  m  can  be  calculated  when  G  is  known  from  the 
area  of  the  gas  chroma tog r -phi c  peak.  The  value  of  k  increases 
by  a  factor  of  approximately  2.3  per  each  100  units  of  Kovats 
Index  increase.  The  emission  from  the  human  subject  is  (750  . : )/sec 

3 

when  750  cm  /sec  is  the  rate  of  airflow  through  the  glass  cell 
from  which  the  human  subject  vapors  are  collected. 

These  remarks  are  to  indicate  that  in  the  equilibration 
sampling  (fluidized  bed.  Section  3.2)  approximate  calibrations 
can  be  obtained  even  for  unknown  materials,  from  their  Kovats 
Index  positions  in  Apiezon  L.  Since  the  Apiezon  L  Kovats 
Indexes  were  'Obtain e<3  in  the  two-column  device  (Sections  3.6.3 
and  4.4)  where  the  first  column  was  a  200-ft  open  tubular 
Carbowax  20M  column,  the  areas  under  peak  in  the  packed  Carbowax  20M 
column  cannot  be  directly  related  to  the  high-resolution  runs 
which  re  ulted  from  the  use  of  two-column  device.  The  packed 
column  analyses  were  to  assist  in  finding  those  gas 
chromatographic  regions  where  the  significant  features  reside. 

It  was  more  important  to  concentrate  adequately  all  organic 
vapors  including  those  with  a  low  volatility,  so  than  they  would 
be  detectable  at  feasible  sensitivities  and  to  analyze  these  in 
Carbowax  20M  column  where  better  peak  shapes  are  obtained  for 
all  polar  components,  including  alcohols  which  chromatograph 
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poorly  on  Apiezon  L.  The  use  of  Carbowax  20M  in  the  fluidized  bed 
collectors  was  not  practical  since  the  sample  collected  too  much 
water  which  then  interfered  with  the  analysis. 

4._2_  Pattern  Classification 

The  various  approaches  to  this  task  were  discussed  in  Section 
3.8.  The  objective  of  the  pattern  classification  was  twofold: 

(1)  to  obtain  evidence  that  the  complex  chemical  signatures,  in 
the  form  of  gas  chromatograms  of  vapors  from  humans  and 
environments,  contain  features  useful  for  their  identification  as 
to  the  type  of  their  sources,  and  (2)  to  indicate  those  regions  in 
the  gas  chromatograms  that  are  most  significant  for  the  source 
discrimination  and  therefore  merit  first  attention  in  further 
chemical  identification  of  the  components. 

4.2. 1  Peak  Match inq  Studie s 

Some  pattern  comparisons  were  conducted  using  simple 
rank  order  statistics  on  gross  indexes  such  as  the  number  of  peaks 
per  chemical  signature-  Here  three  more  or  less  homogeneous  groups 
of  humans  on  which  a  sufficient  number  of  chemical  signatures  were 
available  were  used:  12  males  from  India,  25  Caucasian  American 
males  and  23  Caucasian  American  fv^-ios- 

The  average  numbers  of  peak*  and  the  standard  deviations  in 
the  numbers  of  peaks  (in  parentheses)  in  the  chromatograms  submitted 
to  the  peak  matching  procedures  were  as  follows:  Indian  males  41.2 
(1.8);  Caucasian  males  37.3  (2-5);  Caucasian  females  40.0  (4.2); 
environments  37.0  (6.1)  Wil coxon-White  rank  order  test  applied 
to  the  number  of  peaks  per  gas  chromatogram  as  the  ranking  criterion 
reached  99  Dercent  confidence  level  (less  than  0.01  probability  to 
obtain  such  cifferen  es  by  chance)  when  Indian  males  were  compared 
vs  Cauf'-’siar,  tales,  and  when  Caucasian  males  were  compared  vs 
Caucas.an  females;  the  differences  in  the  peak  numbers  between 
Caucasian  feme les  and  Indian  males  did  not  reach  95  percent 
confidence  level. 
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Further  comparisons  included  data  inspection  after  peak 
matching . 

Most  of  the  pattern  comparisons  for  the  purpose  of  obtaining 
similarity  indexes  were  conducted  after  mat  clung  of  peaks  between 
two  patterns  at  a  time  allowing  for  a  +  4  percent  tolerance  of 
retention  time  normal  lieu  to  n-butanoL  after  addition  of  (  ' 

to  the  normalized  retention  time.  The  0.02  term  was  significant 
only  at  the  initial  time  range  of  the  gas  chromatographic  patterns. 

The  selected  matching  criterion  was  apparently  rather  liberal, 
since  the  number  of  peaks  that  could  be  matched  within  each  pair 
of  chemical  signatures  taken  was  m  excess  of  80  percent  of  peaks 
observed  in  each  of  the  patterns  of  the  pair.  To  assess  if  at 
this  tolerance  level  gross  differences  existed  between  members  of 
the  same  group  versus  the  numbers  of  the  same  or  other  groups, 
seven  randomly  selected  patterns  were  chosen  within  each  of  the 
three  groups  and  compared  with  others.  The  following  observations 
resulted:  from  the  average  number  of  32  peaks  in  2  Caucasian  male 
patterns,  85  percent  found  matches  in  the  patterns  of  other 
Caucasian  males;  from  the  average  number  of  40.2  peaks  in  2  Indian 
male  patterns,  82  percent  found  matches  m  the  pattern;  of 
Caucasian  males;  from  the  average  number  of  39.6  peaks  in  7 
Caucasian  female.?.  84  percent  found  matches  in  the  patterns  of 
other  Caucasian  females.  These  initial  observation^  did  not 
indicate  existence  of  gross  differences  in  the  match  frequencies 
within  the  group  and  between  the  groups. 

The  relatively  high  density  of  the  coverage  of  the  gas 
chromatograms  with  peaks  is  one  of  the  factors  which  may  account 
for  this  initial  result.  If  the  allowed  match  tolerance  is  too 
broad,  there  wi.l’  always  a  high  probability  of  finding  a 
matching  peak.  tolerance  is  too  narrow,  experimental 

variability  of  th  .cention  times  for  the  identical  substance 
may  be  large  enough  to  miss  the  tolerance  limit  and  give  "no 
match"  although  actually  a  perfect  chemical  identity  match 
existed. 
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Consequently,  more  elaborate  pattern  similarity  indexes  were 
explored  in  which  consideration  was  given  not  only  to  match 
versus  nonmatch  in  the  peak  positions  but  also  to  the  magnitudes 
of  the  chromatographic  areas  which  represent  the  abundancies  of 
the  components  eluted  m  the  respective  gas  chromatographic  range. 

The  merits  of  two  similarity  measures--Tanimoto  distance  and 
the  correlation  coefficient  (cf.  Section  3 . 8 ) -••  applied  to  patterns 
matched  to  4  percent  tolerances  were  explored  on  a  part  of  the  data 
available  at  the  time  when  the  computerized  GASCHR  program  became 
operational . 

The  Tanimoto  distance  m  a  non-par ametric  hyperspace  is  zero 
for  two  identical  patterns,  large  for  dissimilar,  and  infinite 
for  completely  non-matching  patterns  in  which  no  peaks  in  one 
match  the  other  pattern's  peaks  m  retention  times  within 

j 

prescribed  tolerances. 

The  test  group  consisted  of  the  chemical  signatures  of  12 
Indian  males,  10  Caucasian  females,  and  9  environments,  (2  plants, 

2  laboratories,  and  5  kitchens).  The  results  indicated  that 
considerably  variations  occurred  within  each  group  and  that  very 
few  Tanimoto  distances  were  below  1.0.  However,  the  average 
distances  for  pairs  within  each  group  and  for  mtergroup  pairs 
were  consistent,  and  patterns  for  humans  were  clustered,  in 
comparison  to  the  average  Tanimoto  distances  between  chemical 
signatures  of  humans  and  environments. 

Indian  males  versus  Indian  males  1.561  (66  pairs) 

Caucasian  females  versus  Caucasian  females  1.817  (45  pairs) 

Indian  males  versus  Caucasian  females  1.809  (120  pairs) 

Indian  males  versus  environments  2,094  (108  pairs) 

Caucasian  females  versus  environments  2.024  (90  pairs) 

This  tabulation  shows  that  (1)  the  Indian  male  group  was  most 
consistent  within  itself  (2)  the  variability  among  Caucasian  females 
was  larger  and  approximately  as  large  as  between  Indian  males  and 
Caucasian  females,  and  (3)  the  difference  between  human  and  the 
environmental  chemical  signatures  was  larger. 

Correlation  coefficients  applied  to  the  logarithmically 
transformed  peak  areas  after  4  percent  tolerance  match  showed 
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similar  relation.-.  The  latge-t  cohecivity  occurred  within  the 
Indian  male  group. 

When  pattern-  were  a-  igned  to  human  versus  environment 
groups  by  comparing  the  average  Tanimoto  distances  from  the 
potter n  in  question  to  2c  human  pattern;  and  to  9  environment 
pattern;  correct  alignment  resulted.  Pattern  assignment  to 
certain  group;  of  humans  on  the  same  comparison  basis  was 
inconsistent.  It  war  concluded  that  if  the  Tanimoto  distance  or 
a  similar  concept  is  used,  the  number  of  comparison  samples  should 
be  increased,  or  some  pr  e- cL  as.-  if  1  cat  ion  of  peaks  into  useful 
and  nonuseful  should  be  conducted. 

The  similarity  indexes  for  patterns  within  the  same  group 
were  relatively  .-mall  -  O.J  to  0.5,  rarely  reaching  above  0.5. 

Many  mismatches  in  sizes  and  appearances  of  spurious  peaks 
corresponding  to  occasionally  occurring  components  in  vapors 
from  humans  lower  the  index  with  the  result  that  many  Tanimoto 
di  st antes  were  above  unit y  and  were  not  good  classifers. 

Thus,  the  pattern  cl  as 1 1 1  cat  ion  approaches  based  on 
similarity  indexes  did  indicate  the  possibility  of  classification. 
However,  it  was  felt  that  discriminant  analyses  where  the  emphasis 
is  not  on  the  measure.-  of  similarity  but  on  the  features  most 
useful  in  class  .-.epai  at  i  on--,  may  be  a  more  powerful  tool  for  the 
cases  when  the  classes  to  which  the  various  patterns  belonged 
were  already  defined  m  terms  of  the  vapor  sources.  Similarity 
indexes  are  more  specifically  suited  for  developing  taxonomies 
when  the  observations  are  not  easily  assignable  to  classes,  but 
dasses  and  subclasses  have  to  be  uncovered  m  poorly  structured 
dat  a  . 

Consequently,  later  work  was  concentrated  primarily  on  using 
discriminant  analysis  as  the  classification  tool. 

4_  2^  2  5 t  epwisfc  i)i.-n  nrnnant  Ana  lyses.- Ar  bit  r  ary  Va_ri_ab_le 

For  the  -fepwi-e  discriminant  analyse--  (cf.  Section  5.8) 
when  the  chemical  identity  of  the  components  is  uncertain  it  is 
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necessary  to  subdivide  the  gas  chromatographic  patterns  into  a 
fixed  number  of  ranges  serving  as  variables  (vectors) ,  so  that 
each  pattern  can  be  expressed  in  terms  of  the  same  number  of 
variables.  If  each  range  contains  not  more  than  one  peak,  the 
intensity  value  for  the  particular  variable  can  be  the  normalized 
area,  or  some  function  of  it.  of  the  peak.  Experience  indicated 
that  if  the  ranges  are  small  enough  to  contain  either  none  or 
very  infrequently  more  than  one  peak,  many  cross-overs  can  occur 
m  a  certain  range,  or  in  the  neighboring  range  while  representing 
the  same  component,  since  the  position  of  a  peak  can  shift  by 
several  reasons  (cf.  Section  3.8). 

Initially,  65  ranges  were  set  up,  with  the  width  of  each 
range  corresponding  to  6  percent  of  the  retention  time 
corresponding  to  the  ranges  lower  retention  time  boundary;  e.g., 
for  the  range  beginning  at  10.0  minutes  the  boundaries  would  be 
10.0  and  10.6  minutes,  respectively.  This  partitioning  resulted 
m  65  ranges.  After  preliminary  runs,  with  49  patterns  available 
at  that  time  in  which  the  retention  times  were  normalized  with 
respect  to  the  n-butanol  retention  times,  21  adjoining  (4  with 
retention  times  smaller  than  that  of  butanol  and  16  with  retention 
times  larger  than  that  including  the  butanol)  were  selected,  since 
( 1)  the  number  of  patterns  presented  for  analysis  must  exceed  the 
number  of  variables,  (2)  the  retention  times  of  peaks  around  the 
butanol  peak  were  more  reliable  when  relative  to  butanol,  and 
(3)  preliminary  analysis  showed  that  the  majority  of  peaks  useful 
for  distinguishing  various  types  of  humans  seemed  to  be  in  this 
i egion . 

By  using  a  correlation  coefficient  matrix,  the  statistical 
process  extracted  successive  significant  variables .  beginning  with 
those  which  produce  the  most  efficient  separation  between 
prescribed  groups  of  patterns.  It  continued  to  extract  the 
variables  (peaks)  in  order  of  their  decreasing  significance.  After 
each  variable  had  been  handled  for  all  patterns,  the  efficiency 
of  classi f l cation  based  on  all  peaks  extracted  up  to  this  point 
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was  evaluated.  This  process  was  applied  to  the  49  patterns 
grouped  in  5  groups:  20  Caucasian  males,  12  Indian  males,  10 
Caucasian  females,  5  various  humans  of  other  types,  and  2  tropical 
plants . 

To  compare  the  efficiency  of  the  classif ication  process  by 
these  logical  groups,  a  random  classification  of  the  same  49 
patterns  was  also  subjected  to  analysis.  Figure  14  shows  that 
after  classifying  13  variables,  only  3  patterns,  or  less  than 
9%,  were  misclassif ied  when  proper  groups  were  used  and  that  15 
patterns,  or  over  30%,  were  mi sclassif ied  in  the  randomized 
groups.  The  partial  classif ication  structure  in  the  randomized 
group  arises  from  the  statistical  probability  that  a  pattern  can 
be  assigned  to  the  proper  groups  by  chance  and  from  the 
presence  of  possible  other  classifiable  features  that  can 
occur  in  the  gas  chromatograms. 

Subsequently,  the  range  of  the  gas  chromatogram  included  in 
patterns  for  the  stepwise  discriminant  analysis  was  expanded  and 
the  width  of  each  range  representing  separate  variables  was 
doubled.  The  purpose  of  this  change  was  to  include  more 
information  and  reduce  the  probability,  for  the  peaks  close  to 
a  range's  boundary,  to  cross  over  the  boundary  into  the  adjoining 
cell.  The  range  of  retention  times  between  0.327  and  5.73  (relative 
to  n-butanol)  was  subdivided  in  21  ranges,  with  n-butanol  being 
in  the  8th  range.  The  rear  boundary  of  each  range  was  equal  to 
the  front  boundary's  retention  time  plus  0.02  plus  12  percent 
of  the  front  boundary's  retention  time.  The  intensity  of  the 
range  was  equal  to  che  sum  of  areas  of  all  peaks  having  apexes 
within  the  range;  the  actual  intensity  was  transformed 
logarithmically,  with  conventional  linear  increments,  to  form  the 
intensity  vector  for  the  discriminant  analysis. 

At  that  stage,  three  statistical  experiments  were  conducted. 

In  one,  discrimin  nt  functions  were  formed  using  27  human  chemical 
signatures  and  5  chemical  signatures  of  air  from  a  cafeteria  kitchen 
area  taken  at  different  times.  The  separation  in  terms  of  two 
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discriminant  functions,  X  and  Y,  after  computer  use  of  6  out  of 
21  variables,  is  shown  m  the  top  part  of  Figure  15.  The  derived 
discriminant  functions  were  used  to  compute  X  and  Y  for  additional 
24  human  chemical  signatures.  The  values  obtained  were  plotted 
over  the  repeat  plot  of  the  top  portion  of  Figure  15,  in  the  bottom 
part  of  Figure  15.  Reasonably  successful  classification  of 
additional  signatures  was  observed  with  15  out  of  24  falling  into 
the  "range  circle1  somewhat  arbiranly  drawn,  and  several  more 
closer  to  the  center  of  gravity  for  the  human  patterns  than  to 
the  cooking  vapor  pattern  center. 

The  second  experiment  is  represented  in  Figure  16.  Here  37 
chemical  signatures  were  classified  in  three  groups:  20  Caucasian 
males,  12  Indian  males  (students  from  India,  mostly  vegetarians)  and 
again  5  cooking  vapor  samples,  using  up  to  14  variables. 

The  third  experiment  consisted  of  classifying  chemical 
signatures  ir,  two  groups:  16  females  and  35  males.  The  classification 
after  using  7  variables  is  shown  in  Figure  17.  This  experiment 
i;  significant  since  here  only  the  chemical  signatures  obtained 
m  the  glass  cell  apparatus  ( cf .  Section  3-1)  were  used,  and  the 
possible  question  on  discriminant  features  introduced  by  sampling 
hum, an s  in  the  cell  versus  sampling  environments  in  air  at  large 
does  not  arise. 

At  this  stage  of  the  work,  normalization  of  the  retention 
times  in  terms  of  Kovats  Indexes  was  introduced  (cf.  Section  3.6.5) 
and  it  appeared  desirable  to  subdivide  the  chromatogram  into  Kovats 
Index  increments  for  the  purpose  of  the  stepwise  discriminant 
analysis.  This  index  is  a  logarithmic  function  of  the  retention 
times ,  hence  equal  index  intervals  approximate  equal  retention 
time  percentage  increments. 

One  set  of  the  statistical  tests  was  conducted  using  arbitrary 
division  of  the  chromatogram  into  ranges  50  Kovats  Index  units  wide. 
Thus,  22  ranges  were  set  up:  one  with  K.I.  below  600,  then  20  ranges 
each  50  units  wide,  and  the  last  range  with  K.I.  above  1600.  This 
converts  each  chromatogram  into  a  set  of  22  variables;  the  intensity 
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of  each  variable  is  equal  to  the  sum  of  logarithmic  transforms 
of  all  peak  areas  with  the  apexes  in  the  given  range.  Two 
discriminant  analyses  were  conducted.  One  used  three  groups: 

39  males,  26  females,  and  10  environments.  The  other  used  human 
signatures,  obtained  m  the  glass  cell  only,  and  into  3  groups: 

25  Caucasian  males,  12  Indian  males,  and  26  Caucasian  females, 

63  humans  in  total.  Tables  XIII  and  XIV  illustrate  the  success 
of  classifications,  in  terms  of  percent  of  patterns  properly 
assigned  to  the  correct  group.  Discriminant  functions  based  on 
13  variables  produced  more  than  80  percent  of  correct  assignments 
for  the  test  containing  63  human  chemical  signatures.  Most  of 
the  classification  was  achieved  at  taking  5  variables  only.  It 
can  be  noted  that  sometimes  upon  inclusion  of  further  variables 
( cf .  variables  Nos.  3  and  4,  female  column  Table  XIII)  the 
percentage  of  correct  assignments  can  decrease.  This  effect 
results  from  compromises  when  optimization  of  group  separations 
leads  to  a  larger  deviation  from  the  group  center  point  for  an 
occasional  case. 

All  these  experiments  conducted  by  mechanical  separation  of 
gas  chromatograms  into  range  variables  indicated  that  discriminating 
features  can  be  found  and  reasonably  successful  functions  formed 
after  using  only  a  fraction  of  the  variables,  but  that  5  to  15 
variables  may  be  needed  to  sufficiently  define  the  chemical 
signature  source  if  the  identification  is  based  on  vapor  gas 
chromatograms  consisting  of  30  to  50  peaks. 

At  this  juncture  it  was  decided  to  explore  if  a  more 
selective  choice  of  variables  (gas  chromatographic  ranges)  can 
result  in  better  and  more  valid  classifications,  or  help  to 
pinpoint,  for  the  future  gas  chromatographic  and  mass  spectro- 
graphic  identification  those  narrower  ranges  where  significant 
discriminating  features  reside. 
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_ Stepwise  Discriminant  Analysis  Wilh  Selv.ctiv.Lty_ 

_ Chosen  Var 1 ables 

In  this  part  of  the  statistical  experiments,  attempts 
were  made  to  use  several  rationales  in  selecting  gas 
chromatographic  ranges  to  serve  as  variables  in  the  input  to  the 
stepwise  discriminant  analysis  program,  instead  of  mechanically 
dividing  the  entire  gas  chromatographic  range  into  a  set  of 
adjoining  ranges,  as  was  done  in  the  work  inscribed  in  Section  4.2.2. 

Histogram  Procedure.  Here  data  on  the  positions  of  gas 
chromatographic  peaks  expressed  m  Kovats  Index  units  were  pooled 
for  all  65  patterns  of  humans  together,  or  for  males  and  females 
separately  and  arranged  in  the  order  of  increasing  Kovats  Index. 
Wherever  peaks  occurred  frequently  m  the  human  chemical 
signatures,  the  number  of  peaks,  e.g.,  per  10  Kovats  Index  units 
was  higher  than  elsewhere.  Figure  18  represents  the  histograms 
of  the  frequency  of  occurrence  of  peaks  in  Caucasian  males  and 
females  along  the  Kovats  Index  scale. 

These  histograms  can  be  considered  as  generalized  gas 
chromatographic  patterns,  with  the  peaks  indicating  those  ranges 
where  peaks  of  components  most  commonly  occur,-  the  peak  areas 
of  the  original  gas  chromatograms  are  not  indicated  by  this  plot, 
except  that  only  those  original  peaks  were  considered  (about  2/3 
of  all  peaks)  which  were  larger  than  40  percent  of  the  average 
area  of  peaks  m  the  corresponding  pattern.  There  are  striking 
similarities  between  the  occurrence  of  peaks  with  the  f  Kovats 
Index  values  m  the  two  generalized  patterns,  but  also  distinct 
differences;  for  instance,  substance  with  K.I.  =  1030  (asterisk) 
much  more  frequently  appears  at  larger  relative  concentration  in 
effluents  from  females  than  from  males.  The  peak  areas  are  not 
considered  in  the  histograms. 

More  histograms  were  prepared  using  an  available  computer 
program  in  which  the  pooled  and  ordered  Kovats  Index  array  is 
scanned,  taking  each  peak  l  in  turn  and  calculating  the  Kovats 
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Index  interval  between  two  peaks  .,the  first  of  which  is,  e.g.,  10 
peaks  before  i  and  the  second  is  10  peaks  after  i.  Wherever  this 
interval  is  smaller,  the  peaks  occur  more  frequently.  An  example 
of  a  part  of  such  printout,  which  is  in  essence  reciprocal  to 
Figure  18,  is  showm  in  Figure  19.  The  points  of  minima  represent 
those  locations  on  the  Kovats  Index  scale  where  peaks  occurred 
more  frequently 

Selection  of  Variables.  By  inspection  of  a  histogram  for 
the  entire  gas  chromatographic  range  of  65  pooled  human  chemical 
signatures,  37  locations  were  found  where  the  peaks  occurred 
most  frequently,  that  is  the  minima  in  the  histogram  such  as  shown 
in  Figure  19  were  most  pronounced.  The  variables  were  set  to 
center  on  the  minima  and  to  have  a  range  of  +  10  Kovats  Index 
units  around  the  selected  minima.  All  peaks  with  apexes  in  the 
K.I.  range  were  considered  to  be  fully  within  the  range.  Thus, 
the  chromatograms  were  each  converted  to  37  variables;  cf .  Section 
3.8  for  the  intensity  transformation  procedure  used  to  assign 
the  intensities  to  the  variables.  If  the  center  points  of  two 
adjoining  ranges  were  closer  than  20  K.I.  units,  the  boundary 
between  the  ranges  was  set  at  the  middle  point  between  t  he  centers 
'’’his  selection  left  15  to  20  possible  high  frequency-of -occurrence 
ranges  not  included. 

The  37  ranges,  called  P-ranges  (peak-containing)  were 
adjacent  at  some  locations,  but  separated  by  ranges  in  which 
peaks  in  the  chemical  signatures  of  humans  were  less  common. 

Table  XV  lists  all  types  of  ranges,  consequently  numbered,  with 
indication  of  the  range  widths  and  the  type  of  range  (P  or  non-P) . 
The  statistical  discriminant  analyses  were  conducted  using  various 
choices  of  variable  sets.  Thus,  all  37  P  ranges,  or  10  of  P-range 
or  the  non-P  ranges  were  tried;  finally,  34  variables  were 
selected,  shown  in  the  last  column  of  Table  XV,  wnich  included 
18  P-renges  shown  to  be  most  discriminating  between  human  ana 
environment  chemical  signatures  and  16  non-P  ranges,  also  shown, 
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among  the  r.on-P  ranges,  to  be  most  discriminating.  The 
consideration  of  nor.-P  ranges  was  made  upon  the  suggestion  of 
personnel  of  the  Ballistics  Research  Laboratory  who  pointed  out 
that  the  pattern  discrimination  should  be  aided  by  considering, 
in  addition  to  the  presence  of  certain  features,  also  the  Lack 
of  other  features. 

Stepwise  Discriminant  Analyses.  Selected  typical  experiments 
are  summarized  here. 

Experiment  1 .  Discrimination  of  65  Human  from  10  Environment. 
Chemical  Signatures  Using  P-Rar.ges  Only.  Table  XVI  summarizes  the 
efficiency  of  classif ication , 

In  run  A,  all  37  variables  were  submitted;  m  run  B,  those 
ten  variables  ranges  were  selected  from  the  37  ranges  where  the 
peaks  most  frequently  occurred  in  human  chemical  signatures.  It 
is  seer,  that  in  test  B,  the  computer  needed  to  use  only  five 
ranges  as  variables  to  separate  91  percent  of  the  patterns  from 
humans  into  one  coherent  group.  In  test  A,  94  percent  coherence 
was  accomplished  after  the  use  of  11  ranges  out  fo  37,  with  no 
improvement  beyond  that.  After  the  extraction  of  17  variables  out 
of  37,  the  computer  reported  that  no  information  that  would  be 
statistically  useful  for  group  separation  was  detectable  in  the 
remaining  20  variables.  The  degree  of  coherence  in  classification 
reached  by  this  technique  of  the  selection  of  variables  was 
considerably  better  than  that  obtained  by  a  division  of  the 
patterns  into  ordert  ranges  without  regard  for  the  inner  logic 
of  the  patterns.  Or.  the  basis  of  data  presented  m  Table  XVI. 
component  identification  can  be  centered  on  the  most  significant 
ranges.  It  must  be  noted  that  the  important  ranges  can  include 
several  peaks  so  that  a  decision  must  be  reached,  by  the  use  of 
higher  resolution  columns,  as  to  which  are  the  important  peaks 
within  the  important  range. 
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of  Human  Versus 


Experiment  2 .  Discriminant  Analysis 
Environment  Chemical  Signatures  Using  P  and  r.on-P  Ranges  .  Here 
two  runs  were  made.  In  Run  C,  36  human  chemical  signatures  were 
selected  at  random  from  65  available  signatures,  and  alL  10 
environment  chemical  signatures;  the  discriminant  analysis  uas 
conducted  using  all  P-ranges  only  (marked  P  in  the  "Range  Width" 
column  of  Table  XV) .  In  Run  D,  the  discriminant  analysis  was 
conducted  using  all  other,  non-P-ranges  only.  Figure  20  compares 
the  relative  efficiency  of  both  selections. 

Classifications  based  on  either  set  of  variables  were 
eventually  comparably  successful.  The  classification  based  on 
those  Kovats  Index  ranges  m  which  peaks  were  frequently  observed 
m  samples  from  humans  was  more  efficient  when  a  smaller  number 
of  variables  was  utilized. 

Experiment  3 .  Derivation  of _ Discriminant  Function  Using 

Part  of  Data  and  Test  of  the  Function  Using  the  Rest  of  Data . 

Here  a  new  selection  of  the  variables  (34  m  total)  was  made. 

It  included  the  most  statistically  significant  variables  both 
from  the  Kovats  Index  ranges  m  which  peaks  frequently  were 
observed  in  the  vapors  from  humans  (P-ranges)  and  the  intervals 
between  these  ranges.  The  selected  variable;  are  marked  Ly 
crosses  m  Table  XV  with  the  P-ranges  also  marked  with  F.  Two 
runs  were  selected  for  the  report. 

In  Run  E,  36  human  chemical  signatures,  randomly  selected 
from  a  total  of  65  available,  and  all  10  environment  chemical 
signatures  were  used  to  derive  the  discriminant  function.  Then 
the  same  discriminant  function  was  appLied  to  classify  the 
remaining  29  human  chemical  signatures.  Figure  21  shows  that  only 
4  out  of  29  or  14  percent  were  misclassif led  if  the  discriminant 
function  based  on  15  variables  (computer-selected  from  34)  was  used. 
The  variables  were,  cf .  Table  XV,  Nos.  18,  30,  35,  37,  44,  47,  49, 
54,  55,  56,  57,  59,  61,  63,  and  65. 

If  the  discriminant  function  derivation  was  stopped  at  a 
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fewer  than  15  variables,  class  separations  were  somewhat  poorer: 

Percent  Correctly 

Variables  Classified  in  Testing 

Included  The  Discriminant  Function 

2  71 

5  74 

15  86 

In  Run  F,  a  three-way  classification  was  tried.  The 
discriminant  functions  X  and  Y  were  derived  for  chemical 
signatures  of  26  females  of  several  races,  22  males  of  several 
races,  and  10  environment  samples,  or  58  patterns  m  total?  15 
additional  male  chemical  signatures  were  kept  if  the  additional 
males  were  correctly  classified  by  the  derived  function. 

The  male  group  used  for  derivation  consisted  of  14  Caucasian 
and  8  Indian  males;  the  male  group  withheld  for  testing  contained 
an  additional  11  Caucasian,  2  Negro,  and  4  Indian  males.  The 
progress  of  classification  dur'ng  the  function  derivation  is 
shown  in  Figure  22.  After  use  of  20  variables  out  of  34 
variables  available  to  the  computer,  the  statistical  noise  level 
was  reached,  with  4  females  still  not  correctly  classified. 

Figure  23  shows  the  classification  field  in  terms  of  two 
discriminant  functions  X  and  Y.  The  hollow  triangles  represent 
the  m^lc  chemical  =  ignat ~  used  in  derivat.cn.  The  fi31.ec 
triangles  represent  the  male  chemical  signatures  used  to  test 
the  functions.  Although  an  intrusion  into  the  environment  ("Airs") 
chemical  signatures  region  occurs,  the  test  points  all  are  far 
away  from  the  region  of  the  field  populated  by  the  female  chemical 
signatures.  This  experiment  again  indicates  that  classifiable 
features  exist  in  the  chemical  signatures  of  males  and  females. 

Other  Statistical  Experiments.  A  number  of  other  class 
separations  was  tried,  e.g.,  Caucasian  males  versus  Caucasian 
females  versus  Indian  males,  and  some  four-and  five-class 
separations.  None  of  these  was  100  percent  successful,  but 
varying  lower  percentages  of  success  were  achieved. 
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The  pattern  stepwise  discr lminant  analysis  led  to  the 
conclusion  that  under  conditions  where  the  chemical  signatures 
are  expressed  in  terms  of  up  to  30  variables,  reasonable 
classifications  by  the  classes  of  sources  (sex.  race,  humans 
versus  environments)  becomes  possible  with  use  of  2  to  20 
variables.  The  required  number  depends  on  the  similarity  between 
the  sources;  in  the  case  of  humans  versus  environments  as  few 
as  two  to  five  variables  suffice  to  effect  80  percent  levels  of 
success.  The  derived  discriminant  functions  held  reasonably 
well  under  tests  with  additional  chemical  signature  patterns. 

The  results  also  indicated  those  Kcvats  Index  ranges  which 
contain  components  most  useful  for  the  source  discrimination  and 
therefore  most  suitable  targets  for  further  chemical 
identification , 

4^3 _ Three-.  Column .  Data 

The  three -column  gas  chromatographic  device  (Section  364) 
produces  readout  which  can  be  analyzed  in  a  two- fold  manner. 

Three -Point  Match  For  the  compounds  for  which  the  three 
time  points  of  exit  (from  the  200  ft.  Carbowax  20M  column  anc 
subsequently  the  short  and  the  long  Apiezor.  L  columns)  have  been 
established .  an  inspection  of  the  three  gas  chromatographic 
recording  tracks  permits  judging  if  these  compounds  are  present 
in  the  sample  of  chemical  signature-  As  in  any  judgement  based 
on  gas  chromatographic  retention  tim.es  in  several  stationary 
phases-  there  are  two  types  of  errors  possible:  (1)  the  location 
of  the  peak  of  a  known  compound  can  be  shifted  because  of  other 
materials  present  .  so  that  a  slight  non  match  does  not  rule  out 
the  presence  of  the  compound  and  (2)  other  compounds  can  produce 
peaks  m  closely  similar  locations  so  that  a  slightly  non -matching 
point  can  be  erroneously  taken  to  represent  the  particular  compound. 
In  the  case  of  a  three-column  device  with  a  continuous  flow  of 
the  components  through  the  system,  additional  erroneous  judgements 
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are  possible,  "Coincidence  ghosts"  can  arise  if  r,  three-point 
match,  or  near-match,  results  from  simultaneous  presence  of 
several  cor.ponent s :  one  component  may  leave  the  Carbowax  20M 
column  at  the  same  time  as  the  known  compound,  while  other 
components  which  left  this  column  at  different  times  can  leave 
the  Apiezon  L  columns  at  the  time  points  expected  for  the  known 
compound.  These  errors  can  be  reduced  if  the  known  compound  can 
be  added  to  a  duplicate  sample  and  the  sample  reanalyzed?  if  the 
peak  in  all  three  points  shows  no  indication  of  broadening  or 
splitting,  the  evidence  that  the  known  is  present  in  the  unknown 
is  very  considerably  strengthened,  while  if  the  peak  broadens  or 
splits,  errors  of  coincidence  or  presence  of  additional  gas 
chromatographically  similar  components  are  likely. 

In  Kovats  Index  system,  approximate  prediction  of  the  gas 
chromatographic  positions  of  compounds  is  possible  from  rules 
operative  in  this  system,  (cf.  Section  3.6.5). 

Table  XVII  lists  some  three-point  matches  obtained  by 
comparisons  of  chemical  signatures  of  humans  with  the  three-point 
locations  of  known  compounds  run  through  the  three-column  device 
independently . 

The  three-point  process,  if  it  is  to  be  utilized  either  for 
the  components  pre-identification  or  *  s  a  pact  of  pattern  analysis, 
n.uet  be  studied  to  establish:  (1)  the  experimental  reproducibility 
of  the  positions  of  the  same  compounds  in  the  context  of  various 
mixtures,  (2)  the  probability  of  occurrence  of  other  compounds 
that  can  produce,  each  completely  by  itself,  similar  sets  of  three 
time  points,  and  (3)  the  probability  of  coincidental  production 
of  a  similar  three-point  set,  by  two  or  three  different  component s. 

Computerized  Indexing,  Here,  peaks  are  matched  without 
reference  to  known  compound  points,  except  that  an  n-alkane 
calibration  scale  is  used  to  convert  the  readings  to  Kovats  Indexes, 
A  program  GA3C0L,  was  written  for  IBM  360  computer  to  process  the 
three-column  gas  chromatographic  data  and  print  cut  the  most 
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probable  sets  of  Kovats  Indexes  in  Carbowax  20M  and  Apiezon  L  for 
the  components  indicated  by  peaks. 

Each  peak  emerging  from  Carbowax  20M  column  enters  both 
Apiezon  L  columns  simultaneously  The  ratio  of  the  corrected 
retention  times  on  the  longs  short  Apiezon  L  columns  is 
approximately  constant.  After  calculating  this  ratio  for  each 
Carbowax  20M  peak  as  the  possible  origin  and  for  the  various 
combinations  of  Apiezon  L  peaks  within  the  gas  chromatographically 
feasible  retention  time  range  and  applying  the  ratio  +  tolerance 
criterion,  the  computer  prints  out  those  peak  sets  that  meet  the 
criterion,  with  the  Kovats  Index  data  calculated  on  the  basis  of 
calibration  with  n-alkanes. 

The  ratio  was  explored  using  known  compounds.  Its  average 
was  2.05.  The  critical  point  is  the  tolerance  allowed.  If  the 
tolerance  is  too  small,  some  components  actually  present  m  the 
sample  can  exhibxt  ratio  which  because  of  normal  experimental 
variations  would  fee  outside  the  limits  and  would  not  appear  in 
the  tabulation.  If  the  tolerance  is  too  broad  the  frequency 
of  mathematical  coincidences  meeting  the  limits  will  begin  to 
increase.  On  the  basis  of  preliminary  experiments  tolerance  of 
+0.10  was  judged  reasonable.  The  computer  program  can  be  run 
with  any  selected  tolerance. 

Figure  24  illustrates  the  data  obtained  by  the  computerized 
analysis  of  r'ne  three-column  gas  chromatograms  of  vapors  collected 
from  the  same  subject  simultaneously.  The  figure  combines  two 
analyses,  at  90°  and  150 °C.  It  indicates  the  utter  complexity 
of  the  composition  and  the  possible  presence  of  groups  of 
compounds  indicated  in  the  upper  right  corner. 

Later  work  with  the  two-column  system  (Sections  3.6,3  and 
4.3)  confirmed  the  complexity  of  peaks  eluted  from  the  200-ft 
Carbowax  20M  column.  Since  the  two -column  data  were  easier  to 
interpret,  and  a  more  detailed  study  was  desired  m  the  gas 
chromatographic  ranges  that  were  significant  in  pattern 
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classification  (cf.  Section  4.2.2  and  4.2.3) ,  the  work  with  the 
three-column  system  was  set  aside  in  favor  of  that  with  the  two- 
column  system.  The  limited  experience  with  the  three-column 
system  operated  isothermally  led  to  *rhe  following  observations: 

(1)  the  system,  in  principle  can  be  useful  for  a  rapid  overview 
of  new  patterns  (2)  since  small  errors  in  short  retention  times 
strongly  influence  the  ratios  of  the  retention  times  .  the  matching 
of  peaks  at  the  beginning  of  the  gas  chromatograms ,  and  for  polar 
components  with  short  retention  m  Apiezon  L  throughout  the 
chromatogram ,  is  less  certain,  so  that  components  actually  present 
can  be  missed  if  tolerances  there  are  set  too  narrow,  (3)  towards 
the  end  of  gas  chromatograms,  resolution  increases  to  the  extent 
that  several  peak  combinations  can  reasonably  meet  the  matching 
criterion . 

It  is  evident  that  the  ease  of  interpretation  of  the  three- 
column  data  significantly  depends  on  the  number  of  components 
present  in  the  sample  and  on  the  resolution  of  columns  used. 

Human  chemical  signatures  exhibit  complexity  that  limits  a 
straightforward  interpretation  of  isothermal  analyses. 

4 ..  4  Two- Co lumn _Da t a 

The  data  on  tentative  identification  of  selected  components 
of  human  chemical  signature  acquired  by  means  of  two-column 
apparatus  are  displayed  in  Table  XVIII.  The  data  were  derived 
from  the  plot  in  Figure  25  The  Carbowax  20M  peaks  examined  by 
this  technique  were  arranged  m  order  of  increasing  Kovats 
Retention  Indexes  and  numbered  accordingly.  The  selection  of 
peaks  to  be  examined  wa.-  based  on  Kovats  Index  Ranges  isolated 
by  the  computer  program  The  ranges  comprised  peaks  with 
maximum  occurrence  frequency  m  chemical  signatures  of  oil 
subjects  previously  studied.  The  computer- isolated  ranges 
together  wit.\  the  number  of  peaks  observed  in  the  ranges  are 
given  in  the  second  column  of  Table  XVIII.  Except  for  Carbowax  20M 
peaks  1  and  3,  all  peaks  examined  fall  within  the  selected  ranges; 
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peak  1  preceeds  the  range  by  only  7  Kovats  Index  units  and  peak 
3  by  12  units. 

Since  the  pattern  analysis  was  based  on  gas  chromatographic 
data  obtained  with  packed  partition  column  while  Table  XVIII  is 
based  on  data  from  the  200-ft  open  tubular  column  with  much 
higher  resolution,  an  exact  correspondence  between  peak  positions 
in  the  two  sets  of  data  is  not  to  be  expected. 

The  twenty-five  parent  peaks  selected  in  elution  from  the 
first  (200-ft  Carbowax  20M)  column  resolved,  as  Table  XVIII 
indicates,  into  over  150  components  on  re-analysis  in  the  50-ft 
Apiezon  L  column.  The  number  of  components  per  peak  ranged  from 
3  to  10.  Contribution  of  these  components  to  the  parent  peak 
varied  over  a  broad  range,  but  usually  one  to  three  daughter 
components  could  account  for  the  most  of  the  parent  peak  area. 

Thus,  the  25  parent  peaks  resolved  in  the  second  (Apiezon  L) 
column,  into  36  major  components  and  approximately  120  minor 
components.  The  major  components  are  marked  by  asterisks.  Some 
parent  peaks,  e.g. ,  one  with  K.l.=154Q,  resolved  into  several 
components  none  of  which  relatively  large  with  respect  to  the 
others,  so  that  none  of  the  daughter  peaks  could  be  asterisked. 

The  two  Kovats  index  values  for  each  peak  partially 
characterize  the  type  of  the  compound.  The  most  convenient 
identification  guide  line  is  through  calculation  of  a  difference 
or  dispersion  parameter* 

(K.I.  in  Carbowax  20M)  -  (K.I.  in  Apiezon  L) 

This  parameter  describes  the  gas  chromatographic  "polarity"  with 
respect  to  Carbowax  20M  stationary  phase.  The  parameter  is 
zero,  by  definition,  for  n-alkanes;  can  be  a  small  number  for 
branched  alkanes;  and  increases  by  aporoximately  70  units  by  presence 
of  a  double  bond  in  the  hydrocarbon.  The  difference  parameter 
is  given  in  the  corresponding  five  columns  of  Table  XVIII ,  where  it 
is  classified  into  five  ranges. 

The  first  range,  0  to  100,  contains  alkanes  and  alkenes.  The 
second  range,  100  to  250,  may  contain  dialkenes,  ethers,  acetals. 
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.substances  with  heteroatoms  in  saturated  or  one- double-bond  ring. 

The  third  range  250  to  380.  is  populated  by  alkanals  (aldehydes) , 

ketones,  esters.  It  also  can  contain  benzene  and  its  alkyl 

derivatives:  however,  benzene  has  a  Kovats  index  pair  (in  Car.bowax 

20M/Apiezon  I.)  960/680:  toluene  1060/796;  o-xylene,  1213/926; 

m-xylene ,  1173/901;  p  xylene,  1166/899;  and  ethylbenzene  1152/884; 

none  of  these  is  represented  within  error  limits  of  order  of 

+10  K.l. ,  in  the  table  and  hence  most  likely  are  not  present  among 

the  over  150  compounds  shown.  Amines  also  can  be  present  in  this 

range.  The  fourth  range,  380-520,  contains  alcanols ,  lower 

alkenals,  and  alkadienals.  The  fifth  range,  above  the  dispersion 

parameter  of  520 ,  can  contain  substances  with  two  polar  groups  on 

the  molecule,  or  with  several  double  bonds  on  alcanol  molecules  - 

The  lowest  member  of  glycol  series,  ethyleneglycol,  has  the  index 

pair  of  1580/660,  and  is  not  present  in  Table  XVIII. 

The  indicated  range  boundaries  are  only  approximate,  expecially 

at  the  lower  end  of  K.l.  values.  This  can  be  illustrated  by  the 

follow'ing  examples.  The  n- alkanals  are  located  at  discrete  K.l. 

pair  values  separated  by  approximately  100  unit  steps  between  the 

successive  members  of  the  series:  n-pentanal  985/685,  n-hexanal 

1078/  783  n-heptanal  1188/867  (assumed,  m  Table,  1220/865)  ,  with 

the  dispersion  parameter  of  300.  ,  and  321,  respectively.  If 

the  polar  functional  group  is  partially  shielded  by  non-pol ar  side 

chains,  the  molecule  behaves  in  less  polar  manner;  thus 

n-butanal  =  326 

2-methyl  butanal  291 

n-hexanal  295 

2-ethylhexanal  271 

Double  bonds,  on  the  other  hand,  increase  the  gas  chromatographic 
polarity,  cf 


2-ethyl  hexanal 

271 

2-ethyl- 2-hexenal 

341 

n-butanal 

326 

n-butenal  (crotonaldehyde) 

433 

2 . 4-hexadienal 

509 
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The  index  pairs,  of  course,  also  a’-o  located  at  a  point 

different  from  the  corresponding  n-alkanal  points.  .Similar  rules 

apply  to  ketones  and  esters: 

Butyl  Butyrate  -  =  287 

Isobutyl  Butyrate  (polar  locus  shielded)  233 

Ethyl  butyrate  285 

Vinyl  butyrate  304 

Propyl  propionate  286 

Isopropyl  propionate  270 

Propyl  acrylate  328 

Allyl  acrylate  392 

Similarly,  for  n-alkanols  the  dispersion  parameter  is  in 

the  range  of  480-510  (except  for  the  lowest  members  of  the  series) 

decreases  if  side  alkyl  groups  obstruct  the  hydroxyl  group. and 

increases  if  double  bonds  are  introduced. 

Thus,  although  the  grouping  by  ranges  in  Table  XVIII  is  not 

ideal,  it  gives  insight  into  the  nature  of  the  components,  with 

the  knowledge  of  the  Kovats  index  in  Apiezon  L  ana  of  the 

approximate  gas  chromatographic  polarity,  estimates  of  the  boiling 

points  of  the  components  are  also  possible  and  can  be  considered 

to  be  within  +  iO °C  of  the  actual  value.  The  estimated  boiling 

points  of  the  components  are  also  tabulated  in  Table  XVIII. 

An  absolutely  positive  identification  of  the  specific 

components  from  the  two  gas  chromatographic  parameters  only  is 

not  an  exact  procedure;  it  merely  eliminates  many  unlikely  choices 

narrowing  the  selection  to  a  few  possible  candidates.  Pending 

the  mass  spectrographic  identification,  a  closer  chemical 

characterization  of  the  over  150  components  was  not  conducted. 

Some  points  fitted  reasonably  well  certain  compounds,  however; 

these  are  indicated  in  parentheses  as  the  most  likely  choices 

in  the  last  column. 

A  survey  of  Table  XVIII  invites  several  interesting 
genera lizat ions . 

First  ,  there  is  an  unexpectedly  frequent  presence  of  alkanes 
and  alkenes,  with  eight  to  fourteen  carbon  atoms,  and  with 
alkane  either  not  represented  or  present  at  low  concentrations. 
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This  can  be  either  a  physiol. ogical  effect,  or  an  effect  that 
characterizes  urban  culture  with  vehicular  exhausts  containing 
unburned  hydrocarbons  prt sent  in  air  and  pickru  by  human  subjects. 
There  are  35  components  of  this  typer  9  of  these  are  among  the 
major  36  daughter-components.  sore  negative  disp^r^ion  parameters 
that  show  up  in  the  tabie  are  inherent  in  the  peak-reinjection 
procedure  where  the  injection  into  the  second  column  is  considered 
to  occur  at  the  middle  of  the  finite  duration  injection  interval, 
(e.g.,  30  sec)  although  actually  some  components  are  present  in 
the  leading  edge,  others  in  the  trailing  edge  of  the  parent 
peak.  An  alkane  contained  in  the  trailing  edge  of  the  injected 
parent  p .  ak  wi’ll  result  in  a  small  negative  dispersion  parameter  . 

The  carbonyl  compounds  (including  esters) ,  with  amines  also 
possible,  occur  in  the  third  dispersion  index  column,  with 
possible  additional  cress  over  into  higher  and  lower  ranges. 

Here,  62  components  occur,  among  these  19  are  major  components 
of  their  respective  parent  peaks. 

The  second  dispersion  index  column  contains  29  components, 
but  only  2  of  these  appear  to  be  major  components  of  the  respective 
parent  peaks. 

The  fourth  dispersion  index  column  *alkar,als,  alkadienals) 
contain  22  components;  5  of  these  are  major. 

The  fifth  gioup  contains  i0  components,  one  of  these  is  a 
major  contributor  to  the  corresponding  parent  peak.  These  are 
multifunctional  substances  or  polyenes  with  polar  functional 
groups,  and  their  presence  may  deserve  further  exploration  since 
they  begin  to  appear  in  the  higher  boiling  point  ranges, 
repeatedly  indicated  to  be  important  in  discriminant  functions. 
Also,  compounds  with  lesser  volatility  can  be  recovered  from,  air 
\it'.  higher  efficiencies  and  perhaps  more  suitable  in  detection 
devices. 

To  summarize  the  inspr  c  *  t  on  of  Table  XVII I  ,  relatively  '.igh 
frequency  of  hydrocarbons,  and  the  dominance  of  carbonyl  compounds 
(lnclucinc  perhaps  esters)  among  the  major  components,  and 
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presence  of  multifunctional  compounds,  or  carbonyl  compounds 
with  more  than  one  double  bond  appear  to  be  the  characteristic 
features  of  the  human  chemica]  signature  studied  by  the  high- 
resolution  two-column  technique.  Approximate  group  character¬ 
ization  of  the  components  is  summarized  as  follows: 


Hydrocarbons 

Acetals,  Ethers,  Heterocyclics 
Carbonyls,  Esters,  Amines 
Alcohols,  Diene  Carbonyls 
Diene  Alcohols,  Polyfunctionals 

Total 


Major 

Minor 

Component  s 
of 

Components 

of 

Parent  Peaks 

Parent  Peaks 

Total 

9 

26 

35 

2 

27 

29 

19 

43 

62 

5 

17 

22 

1 

9 

10 

36 

122 

158 

*A  few  duplications,  for  peaks  close  in  Kovats  Index  values  on 
Carbowax  20M  column,  can  be  possible. 


An  inspection  of  Table  XVIII  also  indicates  that  from  the  25 
parent  peaks  studied,  the  peaks  areas  reasonably  well  represent 
primarily  the  content  of  one  single  daughter  component  in  12  cases 
(one  major  daughter  peak  found  in  each) ,  represent  primarily  the 
content  of  two  daughter  components  in  11  cases,  of  three  daughter 
components  in  3  cases,  and  none  in  one  case.  In  the  latter,  the 
parent  peak  consisted  of  several  daughter  components  none  of  which 
excelled  in  magnitude.  These  observations  should  assist  in 
considering  the  merits  of  single-column  devices  vs.  two-column 
devices  for  pattern  classification  ,  at  the  resolution  levels 
obtainable  with  200-ft  columns.  The  presence  and  the  changes  in 
the  concentrations  of  the  minor  daughter  components  can,  however, 
influence  the  position  of  the  apex  of  the  parent  peak  in  the 
primary  gas  chromatogram  and  under  certain  conditions  influence 
the  pattern  classification  data,  by  shifting  the  peak  assignment 
across  a  range  boundary  into  the  range  of  neighboring  variables. 
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Therefore,  the  significance  of  the  minor  peak  components  in 
pattern  recognition  cannot  be  ignored. 

Aside  from  the  above  25  peaks  eluted  from  the  first  200-ft 
Carbowax  20M  column  and  re-analyzed  in  the  50-ft  Apiezon  L  column, 
the  gas  chromatogram  from  the  first  column  contained  many 
additional  peaks.  Data  obtained  from  vapors  of  the  same  subject, 
a  Caucasian  female,  and  analyzed  at  90°,  120°,  and  150°C  are 
listed  in  Appendix  IX.  There  is  reasonable  similarity  between 
two  analyses  at  150°  (A  and  B)  on  two  subsequent  days  from  the 
same  subject.  A  summary  of  the  number  of  peaks  observed  in 
vapors  is  as  follows: 


Kovats  Index 
Range 

Analysis 
90  120 

Maximum 

Humber  in 

Temperature , °C  Respective 

150(A)  150(B)  Range 

- 

Below  300 

10 

2 

3 

3 

10 

801-1000 

11 

11 

7 

4 

11 

1001-1200 

25 

16 

6 

5 

25 

■A 

1201-1400 

11 

27 

12 

13 

27 

1401-1600 

9 

20 

25 

25 

1601-1800 

2 

25 

21 

25 

7 

3 

7 

TOTAL 

57 

76 

80 

74 

130 

Thus,  prior  to 

the 

subsequent 

split  in  the 

Apiezon  L  column. 

_ 

200-ft  Carbowax  2QM  column. 

used 

i sothermally 

at  three 

different  temperatures,  indicating  the  presence  of  approximately 
130  peaks. 


NT  RESEARCH  INSTITUTE 

62 


4.5 _ Mass  Spectrographic  Identification  of  Human  Signature 

_ Component  s 

The  initial  results  showed  that  the  sizes  of  the  peaks 
of  both  human  signatures  studied  fell  below  the  sensitivity  of 
the  mass  spectrograph. 

The  experiments  on  the  mixture  of  n-butanol  and  heptanal 
showed  that  both  compounds  were  positively  identified  each  time 
their  respective  gas  chromatographic  peak  sizes  matched  those  of 
the  major  and  medium-size  components  of  human  signature. 

The  standard  gas  chromatographic  analysis  of  the  human 
signature  collected  on  the  large  fluidized  bed  showed  a  significant 
increase  in  the  sample  magnitude.  The  size  of  the  sample  found 
was  at  least  ten-fold  that  of  the  sample  usually  collected  on  the 
standard  bed.  Also,  the  peak  distribution  pattern  was  similar  to 
the  pattern  previously  obtained  from  the  same  subject  under  routine 
collection  conditions. 

Mass  spectrographic  analysis  of  human  chemical  signature 
collected  on  the  large  bed  and  fed  directly  from  the  open  tubular 
column  produced  patterns  which  in  most  cases  were  too  complex  for 
immediate  interpretation.  One  analysis  indicated  the  presence  of 
a  compound  (naphthalene)  probably  transferred  to  human  from  moth- 
producing  materials.  The  peak  of  this  compound  was  usually  well 
resolved  as  observed  in  the  respective  gas  chromatographic  analyses 
of  the  sample. 

The  direct  mass  spectrographic  analysis  of  effluent  carrying 
human  samples  collected  on  standard  size  beds  produced  patterns 
which  in  all  cases  studied  were  too  low  above  the  instrument 
background  for  positive  identification. 

The  difficulties  encountered  in  the  mass  spectrographic 
identification  of  human  signature  components  result  mainly  from 
the  lack  of  sample  resolution  rather  than  its  size.  This  is 
supported  by  the  positive  identification  of  the  napht.helene 
(since  its  peak  was  well  resolved) ,  by  the  results  of  experiments 
on  1-butanol  and  heptanal  and  by  recent  frequent  observations  made 
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during  the  analysis  of  human  signatures  on  the  two-column  apparatus. 
The  results  of  these  experiments  indicated  that  most  Carbowax  20M 
peaks  were  multi-component  peaks. 
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5.0  PROGNOSIS  FOR  CHEMICAL  SIGNATURE  DISCRIMINATION  DEVICES 

The  work  conducted  indicated  that  the  complex  chemical 
signatures  of  humans  and  environments  contain  features  which 
permit  classification  by  source  type  using  relatively  straight¬ 
forward  statistical  discriminant  techniques.  The  number  of 
features  that  must  be  taken  in  account  and  their  distribution 
within  the  gas  chromatographic  spectrum  depends  on  the  type  of 
sources  that  must  be  discriminated.  Although  it  is  possible  that 
further  research  may  uncover  some  specific  substances  or  a  few 
substances  that  characterize  e.g.,  humans  versus  environments, 
it  is  questionable  if  signature  detection  and  identification 
devices  should  be  based  entirely  on  such  features.  A  device  that 
can  be  based  on  sufficient  redundance  of  features  is  less 
likely  to  suffer  from  interferences.  From  the  findings  thus  far, 
it  appears  that  7  to  15  complex  features  may  be  necessary  to 
reasonably  characterize  the  source.  The  two-column  data  also 
indicate  that  these  features  consist  of  several  chemical 
components  some  of  which  probably  represent  useful  features 
while  others  act  only  as  chemical  noise. 

Two  questions  can  be  posed:  (1)  which  processes  are  best 
suited  for  signature  identification  and  (2)  can  a  sufficiently 
rapid  device  be  envisioned? 

Since  the  useful  features  are  represented  by  various  molecular 
species,  it  would  be  desirable  to  use  vapor  separation  methods 
that  permit  retention  of  the  molecular  identities  rather  than  other 
processes  (such  as  mass  spectrometry  on  non-separated  mixtures) 
which  splitter  the  molecules.  Such  methods  are  the  basis  of  gas 
chromatography.  Separation  by  interaction  with  a  polar  medium,  as 
used  in  the  present  work,  permits  locating  the  discriminating 
features.  It  appears  that  a  useful  discriminant  system  can  be 
based  on  the  principle  of  preseparation  in  polar  columns  and 
inspection  of  the  discriminating  regions  in  the  second  column  or 
columns,  cf.  *.4.  Vertical  cuts  through  e.g.,  Carbowax  20M  versus 
Apiezon  L  field  plot  in  the  regions  of  interest  should  produce 
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enough  information  for  identification  of  source  types. 

The  second  question,  on  the  order  of  speed  that  may  be 
achieved  by  such  devices,  must  be  answered  in  two  steps: 
can  a  preconcentration  of  organic  vapors  from  air  be  conducted 
fact  enough,  and  can  the  subsequent  task  of  identification 
of  the  features  be  conducted  fast  enough? 

Concentration .  Although  higher  sensitivities  to  all  or  to 
some  substances  can  be  achieved  with  some  gas  chromatographic 
detectors  especially  if  helped  by  very  careful  work  and 
adjustments,  nanogram  (10-9  g)  amounts  of  individual  substances 
of  all  types  can  be  relatively  easily  detected  by  devices  widely 
used  in  routine  gas  chromatographic  practice.  For  a  typical 
substance  with  a  molecular  weight  of  100,  this  corresponds  to 
6  X  1012  molecules.  Assuming  the  desired  detectability  level  at 
109  molecules/cm3  of  air  and  the  maximum  gas-sample  size  into  the 
device's  processing  channels  at  1  cm3,  a  concentration  factor  of 
6000  is  needed  to  bring  the  lowest  concentrations  into  the 
detector's  performance  range.  Solubility  of  vapors  of  organic 
materials  in  gas  chromatographic  stationary  phases  decreases  by 
a  factor  of  such  magnitude  when  temperature  is  changed  from 
ambient  to  above  200  to  250 °C.  Hence,  equilibrium  absorption  from 
air  at  ambient  temperatures  and  desorption  at  above  200 °C,  in 
principle,  permits  one-step  concentration  of  the  dilute  chemical 
signal  to  the  level  needed  for  a  routine  detection  by  usual  gas 
chromatographic  detectors. 

There  is  no  limitation,  in  principle,  why  such  a  process 
cannot  be  conducted  with  speeds  of  order  of  1  sec.  A  thermodynamic 
requirement  and  a  kinetic  requirement  must  be  met. 

(k)  (ng)  (v)  >  6  x  1012  (I) 
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where 


k  is  the  dimensionless  partition  coefficient  for  the  vapor 
in  the  stationary-phase/air  system,  equivalent  to  the 
factor  by  which  the  concen* rat  ion  of  the  vapor  molecules 
in  the  solution  in  the  phase  is  larger  than  their 
concentration  in  air  when  equilibrium  is  reached 

na  is  the  concentration,  in  molecules/cm"*,  of  the  vapor  in  air 

v  is  the  volume,  in  cm^  of  the  stationary  phase,  in  the 
concentrating  device 

x  is  the  thickness  of  the  absorbing  film  of  the  stationary 
phase  in  the  device,  cm 

D  is  the  diffusion  constant  of  the  vapor  molecules  in  the 
stationary  phase,  cm.2  sec  -1 

Partition  coefficients  at  ambient  temperature  for  vapors  of 
substances  with  ambient  saturated  vapor  pressures  of  several 
millimeters  or  lower  in  appropriate  stationary  phases  easily 
reach  into  the  range  of  10^  or  higher.  Therefore,  the  first 
requirement  satisfying  the  sensitivity  of  the  detectors  can  be 
met,  for  nQ  =  109 ,  with  a  few  cubic  centimeters  of  a  suitable 
stationary  phase.  The  second  requirement,  from  the  diffusion 
laws,  must  be  met  to  perform  absorption  equilibration  in  a  time 
span  of  order  of  1  sec.  With  a  typical  D  =  10~®,  the  permissible 
film  thickness  is  of  order  of  10”^  cm,  well  within  technologically 
feasible  limits. 

Resolution.  The  complete  resolution  of  the  chemical  signal 
captured  from  air  would  consist  of  a  full  inventory  of  the 
molecular  species  present  in  air  at  the  time  of  inspection  by  the 
analog.  As  in  any  process,  compromise  must  be  accepted  between 
resolution  and  time.  Resolution  is  possible  at  two  stages  of 
processing  —  in  sending  the  vapor  mixture  that  constitutes  the 
signal  with  assistance  of  a  carrier  gas  through  a  gas 
chromatographic  partition  tube, and  in  using  species-selective 
detectors  on  the  signals  resolved  in  the  first  stage.  Most 
detectors  and  their  electronic  circuitry  have  very  fast  response, 
and  their  specificity  cannot  be  improved  by  simply  allowing  more 
time.  Therefore,  signal  processing  time  and  resolution  depend 
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primarily  on  partition-tube  performance. 

The  resolution  that  can  be  produced  per  second  can  be 
estimated  from  the  following  equation*: 


(r 


23 


1) 


_ X _ 

(l+k)2 


where 

R23  is  the  resolution  of  the  vapors  of  substances  2  and  3 
(R23  =  1  indicates  very  distinct  resolution  with  a 
negligible  overlap  of  signals) 

r23  is  the  ratio  of  partition  coefficients  of  the  vapors 
2  and  3  in  the  stationary  phase  at  resolution 
temperature 

t  is  the  time  in  seconds 

k  is  the  retardation  ratio  in  the  partitioning  tube  and 
depends  on  the  volume  of  gas  space,  the  stationary 
phase  in  the  tube,  and  the  partition  coefficient  of  the 
vapor  in  the  stationary  phase 

N  is  the  equivalent  plate  number  cf  the  partitioning  tube 

t  is  the  gas  holdup  time  (volumetric  flow  rate  of  carrier 
gas  through  the  tube  divided  by  gas  phase  volume) 


We  now  use  this  equation  to  estimate  what  ratio,  r23, 
partition  coefficients  of  2  and  3  can  be  ascertained  in  1  sec 
(R2</t  =  1)  in  a  typical  short  partition  tube.  For  capillary 
tubes,  k  is  of  order  of  0.2  to  20,  and  a  10-cm-long  tube  can 
typically  be  equivalent  to  N=100.  Selecting  k=l ,  and  tm=0.1t, 
r23=l  04.  A  difference  of  4%  in  the  partition  coefficients  of 
2  and  3  should  suffice  to  tell  these  components  apart  within  1  sec. 
Other  estimates**  show  that  a  "100-plate"  partition  tube  can 
permit  recovery  of  substance  2  with  less  than  5%  impurity  of  the 
substance  3  and,  in  reverse,  if  both  are  present  in  equal  amounts 


*Loyd,  R.J.  ,  Ayers,  B.O. ,  and  Karasek,  F.W.  ,  Anal.  Chem. ,  32 , 
p.  43 ,  698  (1960)  . 

**Littlewood ,  A.  B.  ,  Gas  Chromatography,  p.  136,  Acad.  Press, 
New  /ork,  1962. 
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in  the  original  mixture  and  their  partition  coefficients  differ  by 
40%  or  more. 

In  the  case  of  very  complex  vapor  mixtures,  many  compounds 
are  present,  and  the  above  resolution  is  inusfficient  for  a 
complete  inventory.  There  is  a  continuous  overlapping  of  the 
substances  with  small  differences  in  the  retention  times.  Ir. 
usual  cases,  low-resolution  processing  then  results  in  a  poorly 
differentiated  effluent  from  the  partition  tube  with  peaks  of 
noticeable  size  concentrations  considerably  in  excess  of  the  total 
concentration  of  other  poorly  differentiated  substances.  Any 
portion  of  the  effluent  from  the  above  "100-plate"  partition  column, 
taken  during  the  time  interval  of  t  +  0.4t,  contains  over  95%  of 
the  component  or  components  with  the  retention  time  t,  and  a  lesser 
fraction  of  components  with  retention  times  within  the  t  +  0.4t 
boundary. 

In  the  notation  of  Kovats  retention  index  (cf.  Section  3.6.5), 
the  retention  times  are  referred  to  the  scale  of  n-alkanes  with  a 
logarithmic  transformation  to  produce  intervals  of  100  Kovats 
Index  units  between  each  of  two  consecutive  members  of  the 
homologous  series  of  n-alkanes.  Two  compounds  with  retention 
times  of  t  and  t  +  0.4t  produce  an  index  difference  of  50  K.I. 
units.  The  interval  between  C2  (200  K.I.)  and  C20  (2000  K.I.) 
alkanes  carries  36  intervals  of  50  K.I.  units  each.  Thus,  in 
principle,  a  parallel  multichannel  unit  with  separate  non-polar 
phase  channels  operated  at  different  processing  regimes  (temperatures, 
flow  rates,  and  times)  can  split  the  Kovats  Index  continuum 
corresponding  to  compounds  with  boiling  points  up  to  400 °C  into  30 
to  40  cells.  Each  cell  is  populated  by  molecular  species  with 
relatively  similar  partition  coefficients  in  the  gas/non-polar  phase 
system.  The  detector  response  at  the  exit  of  each  low- resolution 
channel  is  a  complex  function  of  the  population  distribution  within 
the  cell  with  frequency  maxima  at  those  time  points  at  which  some 
more-than-usually  abundant  molecular  species  elute. 
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Various  discriminators  can  then  be  applied  to  the  population 
of  each  of  the  cells.  Two  types  of  discriminators  can  be  derived 
from  gas  chromatographic  practice.  One  is  a  polar  or  otherwise 
specific  stationary  chase  in  a  low-resolution  column  with  its 
characteristics  being  the  shift  of  the  elution  positions  with 
respect  to  the  n-alkane  position.  The  other  is  the  sensitivity 
of  the  detector  to  some  atomic  species,  e.g. ,  a  sulfur-sensitive 
detector  indicates  the  distribution  of  sulfur  compounds  among 
the  cells  and  within  each  cell.  The  species-sensitive  detector 
can  apply  to  the  initial  multichannel  signals  or  to  the  signals 
first  distorted  by  the  second-stage  discriminator  columns.  Since 
the  dispersion  shifts  easily  exceed  50  K.I.  units,  a  low-resolution 
discriminator  is  sufficient  to  produce  a  rather  informative 
dispersed  signal.  The  shifts  are  characteristic  of  functional 
groups  in  the  molecule  and  of  the  steric  clustering  of  non-polar 
parts  of  the  molecule  around  the  functional  group. 

Thus,  there  is  no  fundamental  difficulty  in  development  of 
processes  that  can  characterize  chemical  signatures  of  humans  within 
a  few  seconds  to  a  degree  sufficient  for  classification  by  sources. 
The  actual  choice  of  the  processing  steps  requires  however  a 
massive  experimental  and  theoretical  effort. 
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6.0  SUMMARY  AND  CONCLUSIONS 


Within  the  context  of  the  present  work,  chemical  signatures 
are  defined  as  compositional  characteristics  of  organic  vapors 
emanated  into  air  by  various  sources.  The  objective  of  the 
conducted  work  was  to  estatlish  if  features  exist  in  the  chemical 
signatures  of  humans  that  can  be  used  to  distinguish  humans  from 
environments  and  to  distinguish  among  humans.  A  further  objective 
was  to  characterize  those  features  as  far  as  possible  within  the 
limits  of  the  effort  and  presently  available  technology.  An 
all-glass  Teflon  cell  was  available  from  the  previous  project 
(DA-11-022-AMC-1775 (X) ) .  This  cell  permitted  isolating  humans 
temporarily,  from  the  surrounding  air  and  maintain  them  in  a 
flowing,  highly  purified  air  stream  so  that  better  samples,  not 
confused  by  a  continuous  arrival  of  contaminants  from  air,  could 
be  collected. 

It  was  soon  discovered  that  many  cf  the  usual  sample 
collection  procedures,  such  as;  collection  in  vessels,  cryogenic 
cooling  of  air,  or  adsorption  on  carbon,  were  unsuitable  for  the 
chemical  signature  acquisition  when  the  chemical  signatures 
consisted  of  a  large  variety  of  organic  materials  available  in 
air  at  concentrations  as  low  as  parts  per  billion. 

Consequently,  considerable  effort  was  directed  toward  the 
development  of  improved  techniques  for  the  acquisition  and 
processing  of  samnles  of  chemical  signatures  from  humans  and 
various  environments.  The  essential  features  of  this  development 
involved  new  systems  for  sample  collection,  transfer,  and 
injection  into  the  g*s  chromatograph.  The  sample  acquisition  was 
based  on  the  concept  of  collection  through  equilibration.  The 
collectors  were  special  fluidized  beds  of  Teflon  powder  coated 
with  a  thin  film  of  Apiezon  L  stationary  phase.  The  fluidized 
bed  configuration  of  the  collector  produced  the  best  possible 
contact  of  samples  with  air.  The  Apiezon  L  permitted  later 
recovery  of  the  dissolved  sample  with  minimum  artifacts  from 
decomposition  of  the  trapping  phase  itself.  In  a  typical 
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collection  of  a  sample  from  a  human,  the  bed  material  was 
exposed  to  air  carrying  the  vapors  at  the  flow  rate  of  0.75  liters 
per  sec.  The  environment  chemical  signatures  were  collected  by 
pulling  the  ait  through  the  sampler  by  a  small  blower.  Since  the 
collection  involved  equilibration,  the  actual  airflow  rate  through 
the  sampler  did  not  have  to  be  known  and  was  not  critical  as  long 
as  the  powder  was  kept  in  the  fluidized  state. 

A  special  calibration  procedure  was  devised  to  correlate  the 
concentration  of  a  known  sample  component  in  air  directly  to  the 
size  of  the  peak  on  the  gas  chromatogram,  to  check  the  collection 
efficiency  and  to  establish  the  time  required  for  attainment  of 
equilibrium  concentrations  for  various  types  of  compounds. 

The  recovery  of  the  collected  sample  from  the  trapping  film 
and  its  injection  into  the  gas  chromatograph  for  analysis  was 
effected  by  a  special  sample  transfer  and  injection  system.  The 
sample  transfer  system  was  designed  to  allow  the  transfer  of  the 
vapors  from  the  bed  maintained  at  80 °C  into  a  special  stainless 
steel  thin  wall  needle  which  was  kept  at  liquid  nitrogen 
temperature.  The  transfer  was  effected  with  cryogenically 
purified  gas.  The  sample  injection  system  was  constructed  to 
deliver  the  sample  from  the  injector  needle  to  the  gas 
chromatographic  column  in  a  timed,  automatic  and  reproducible 
manner. 

The  sample  acquisition  step  was  found  to  require  45  min  to 
produce  reproducible  and  equilibrated  samples.  The  typical 
duration  of  the  sample  transfer  step  was  6  min.  During  this 
period  70  to  80  percent  of  the  vapor  sample  was  found  in  the 
injector  needle.  Higher  efficiency  could  be  achieved  with  a 
longer  transfer  time  but  at  the  expense  of  increased  analytical 
time.  The  sample  injection  step  resulted  in  reproducible  and 
practically  quantitative  injections  with  95  to  98  percent  of  the 
sample  transferred  from  the  needle  and  introduced  into  the 
injection  port  of  the  gas  chromatograph. 

The  standardized  techniques  developed  in  the  initial  phase 
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of  this  work  ware  used  to  obtain  65  gas  chromatographic  patterns 
of  human  signatures  and  10  patterns  of  various  environments.  The 
human  signatures  were  collected  from  healthy  subjects  of  both 
sexes  and  several  races.  The  races  included  American  Caucasian 
males  and  females,  Indian  males  (born  in  India' ,  a  few  American 
Negroes,  and  a  Mexican  and  a  Malayan  subject. 

The  first  series  of  chemical  signatures  were  analyzed  at 
125 °C  using  a  10-ft,  1/3-in  O.D.  packed  bed  columns  with  a 
Carbcwax  20M  stationary  phase.  Later  work,  conducted  on  a 
smaller  variety  of  samples  from  humans,  utilized  50  and  200-ft 
long  suppcrt-coated  open  tubular  columns  with  a  Carbowax  20M 
costing.  These  provided  much  higher  resolution  and  were  used 
isothermally  at  several  temperature  levels  (90°,  125°,  and  150°C). 

In  still  later  work,  a  specially  modified  two-column  gas 
chromatograph  was  used  in  which  the  sample  was  first  analyzed  in 
a  200-ft  support-coated  open  tubular  Carbowax  20M  column  and  the 
gas  chromatographic  peaks  found  previously  to  occur  most 
frequently  in  the  human  chemical  signatures  were  further  analyzed 
in  a  50-ft  support-coated  open  tubular  A.piezon  L  column.  This 
procedure  allowed  the  resolution  of  peaks  into  further  components 
and  a  subsequent  estimation  of  their  gas  chromatographic  polarities. 
A  three-column  gas  chromatograph  was  also  used,  combined  with  a 
computer-analysis  of  data,  to  characterize  the  con  >onents  of  the 
chemical  signatures  in  similar  terms. 

The  great  complexity  of  the  chemical  signatures  was  revealed 
by  the  experimental  survey  of  the  samples  from  65  humans,  and  10 
environments  in  analysis  in  the  10-ft  packed  columns.  At  this 
stage,  the  data  were  studied  by  several  computerized  statistical 
analysis  techniques,  to  discover  if:  (1)  valid  discriminant 
functions  can  be  extracted  permitting  classification  of  the 
signatures  by  the  types  of  their  sources,  (2)  how  many  gas 
chromatographic  features  need  to  be  used  to  obtain  classifications, 
and  (3)  indicate  those  features  which  are  most  significant  in 
forming  discriminant  functions  and  therefore  deserve  priority  in 
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their  chemical  identification. 

Attempts  at  mass  spectrographic  or  other  identification  of 
components  at  various  degrees  of  gas  chromatographic  resolution 
were  also  made.  These  indicated  that  the  highest  degrees  of 
resolution  in  a  two-column  system  will  be  needed  to  resolve  the 
chemical  species  present  in  the  chemical  signatures  of  humans  to 
a  degree  where  meaningful  identifications  of  the  individual 
components  becomes  possible.  This  work  remains  an  objective  for 
the  future,  and  in  many  cases  will  require  further  enrichment  of 
the  sample  to  reach  concentrations  sufficient  for  identification. 

The  data  were  organized  in  a  Kovats  Index  system  and  are 
tabulated  in  the  Appendix. 

The  following  conclusions  were  reached: 

(1)  At  the  resolution  available  isothermally  with  10-ft 
packed  gas  chromatographic  columns  and  using  collection 
and  sample  processing  methods  to  ppb  of  organic  species 
in  air,  the  human  chemical  signatures  exhibited  from 

32  to  48  features  per  signature. 

(2)  Statistical  pattern  analysis  conducted  on  the  chemical 
signatures  at  the  above  resolution  levels  indicated  that 
from  2  to  15  gas  chromatographic  variables  are  needed 

to  reach  meaningful  classification  of  the  signatures 
by  their  sources.  The  number  of  variables  (features) 
that  must  be  utilized  increases  as  the  similarity  of 
sources  (e.g.  Caucasian  males  vs.  females)  or 
variability  within  classes  of  sources  (e.g.  males  of 
several  races  vs.  females  of  several  races)  increases. 

(3)  Some  classifications  (e.g.  humans  vs.  environments) 
based  on  classical  stepwise  discriminant  analyses  of 
patterns  resulted  in  discriminant  function  which 
reasonably  successfully  classified  additional  test 
cases. 

(4)  There  were  indications  that  the  decision  boundaries  for 
the  chemical  signature  classifications  in  statistical 
hyperspaces  may  be  more  complex  than  accessible  to  the 
classical  stepwise  discriminant  procedures. 

(5)  Detection  and  source  discrimination  devices  based  on  the 
principle  of  chemical  signatures  must  most  probably  be 
based  on  detection  and  correlation  of  5  to  15  features 
at  resoJution  levels  outlined  in  the  conclusion  of 
paragraph  #1. 
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(6)  Chromatographic  analysis  with  resolution  obtainable 
isothermally  at  several  temperature  levels  from  200-ft 
support-coated  Carbowax  20M  column  exhibited  typically 
100  to  130  peaks.  Twenty-five  of  these,  reanalyzed 
isothermally  on  50-ft  support-coated  Apiezon  L  column 
separated  further  into  3  to  10  chemical  components, 
thus  producing  over  150  observable  components.  The 
total  number  of  components  observable  at  this  level 

of  two-stage  resolution  in  the  chemical  signature  of 
a  selected  human,  can  be  estimated  to  be  in  the  range 
between  500  to  1000. 

(7)  Classification  of  the  observed  components,  by  their 
gas  chromatographic  polarity,  with  attention  to  those 
components  most  frequently  observed  in  humans,  indicated 
the  presence  of  considerable  amounts  of  hydrocarbons, 
carbonyl  compounds  and  a  few  alcohols.  Compounds 
considerably  more  polar  than  alcohols  were  also 
observed  and  probably  belong  to  species  with  more  than 
two  polar  groups  per  molecule,  or  contain  more  than 

one  double  bond,  in  addition  to  a  polar  group. 

(8)  Mass  spectrographic  identification  succeeded  only  on 
two  compounds  which  originate  most  probably  from 
moth-proofing  materials  and  therefore  represent  features 
characteristic  of  advanced  material  culture.  The 
surprising  f requency-of-occurrence  of  hydrocarbons  may 
be  related  to  a  similar  environment  permeated  by 
vehicular  exhaust.  Future  mass  spectrographic 
identification  should  be  attempted  only  on  components 
after  their  resolution  in  two-column  system. 
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Table  XL 

CONTRIBUTIONS  TO  EFFECTIVE  CARBON  NUM3ER 
IN  HYDROGEN  FLAME  IONIZATION  DETECTOR 


Aliphatic  C 
Aromatic  C 
Olefinic  C 
Acetylenic  C 
Ketone,  Aldehyde  C 
Nitrile  C 
Ether  0 

Primary  Alcohol  0 
Secondary  Alcohol  O 
Tertiary  Alcohol  0 
Ester  0 

Two  or  more  Cl  on  Single 
Aliphatic  C 
Cl  on  Olefinic  C 
Amine  N 


1.0 
1.0 
0.05 
1 . 30 
0 

0.3 

-1.0 

-0.6 

-0.75 

-0.25 

-0.25 

-0.12  each  Cl 
+0.C5 

as  0  in  corresponding 
alcohol s 


Table  XII 

n- BUTANOL  EMISSION  FROM  VARIOUS  HUMANS 


2  Rate 

Subject _ Peak  Area .  cin  Molecules/Sec 

CAUCASIAN  MALE 


A 

2.00 

0.62  X  10 

1,60 

0,49 

3.20 

0.99 

Ba 

7.78 

2.42 

5.52 

1.71 

3.28 

1.02 

0.53 

0.16 

y. 

1,76 

0.55 

cb 

48.00 

14.90 

CAUCASIAN  FEMALE 

D 

11,64 

3.61 

E 

6,08 

1.89 

F 

46.30 

14.40 

INDIAN  MALE 

/“i 

3.60 

1.12 

H 

4.13 

1.28 

I 

3.84 

1.19 

JC 

14.30 

4,43 

4.00 

1,24 

19.88 

6.16 

K 

4.64 

1.44 

L 

5.04 

1.56 

M 

9.92 

3.07 

N 

1.28 

0.40 

0 

3.50 

1.08 

P 

2.08 

0.65 

NEGRO  MALE 

Q 

5.28 

1.64 

NEGRO  FEMALE 

R 

60.48 

18.75 

S 

56.40 

aFirst  3  measurements  taken  when  subject  was  on 
average  mixed  diet?  4th  and  5th  measurements  taken 
7  and  8  days  later,  respectively,  after  subject  had 
been  on  special  diet  of  fish,  cheese,  milk  and  tea, 

y. 

“subject  is  a  chemist  working  with  various  organic 
solvents, 

“subject  is  a  strict  vegetarian;  other  Indian  subjects 
mostly  vegetarians. 


Table  XIII 


DISCRIMINANT  ANALYSIS  OF  CHEMICAL  SIGNATURES 
OF  MALES,  FEMALES,  AND  AIR  FROM  VARIOUS  ENVIRONMENTS 


Variables 

Consecutive 

Number* 

Considered 
Corresponding 
Kovats  Index 
Range 

Number  of  Cases  Considered 

Males  , 
Cases 

39  Females , 
Correctly  Cla 

2_6  Air ,  10 
ssif ied  ,  % 

1 

1250-1300 

77 

34 

70 

2 

1350-1400 

77 

38 

60 

3 

1450-1500 

82 

51 

90 

4 

1050-1100 

84 

48 

90 

5 

650-  700 

84 

62 

90 

6 

>  1600 

84 

62 

90 

15  850-  900  87  79  90 


★ 

In  order  selected  by  the  computer 


At  15  variables  considered,  the  following  Kovats  Index  ranges 
have  not  been  selected  by  the  computer  as  useful  discriminant 
700-750;  800-850;  950-1000;  1000-1050;  1150-1200;  1400-1450; 
1550-1600. 
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Table  XIV 


DISCRIMINANT  ANALYSIS  OF  CHEMICAL  SIGNATURES 
OF  CAUCASIAN  MALES.  INDIAN  MALES,  AND  CAUCASIAN  FEMALES 


Variables 

Considered 

Number 

of  Cases 

Considered 

Corresponding 

Caucasian 

Indian 

Caucasian 

Consecut ive 

Kovats  Index 

Males  .  2  5 

Males  , 

12  Females.  23 

Number* 

Ranae 

Cases  Correct ly 

Classified,  % 

1 

1250-1300 

60 

41 

61 

2 

700-  750 

56 

75 

73 

3 

1350-1400 

64 

75 

69 

4 

650-  700 

64 

75 

69 

5 

1550-1600 

72 

75 

69 

13 

1600 

83 

83 

80 

In  order  selected  by  the  computer 

At  13  variables  considered,  the  following  Kovats  Index  ranges 
have  not  been  selected  by  the  computer  as  useful  discriminants 
<  600:  600-650;  850-900;  900-950;  1150-1200;  1200-1250; 
1300-1350;  1400-1450;  1500-1550. 
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Table  XV 


LIST  OF  VARIABLES  (K.I.  RANGES) 

(Those  ranges  in  which  peaks  most  frequently 
occurred  in  vapor  from  humans  are  marked  P) 


3 


1 


i 


Consecutive  Number  Variables 

of  Range,  K.I.  Range  Range  Width,  Selected  for  Latest 

_ (Variable) _  From-To  K.I.  Unit  s  Class  Separation? 


1 

0- 

465 

465 

2 

465- 

485 

20  P 

3 

485- 

516 

31 

4 

516- 

536 

20  P 

X  P 

A 

536- 

590 

54 

X 

6 

590- 

610 

20  P 

X  P 

7 

610- 

645 

35 

a 

645- 

665 

20  P 

X  P 

9 

665- 

666 

1 

10 

666- 

686 

20  P 

X  P 

11 

686- 

706 

20  P 

X  P 

12 

706- 

722 

16 

13 

722- 

740 

18  P 

14 

740- 

758 

13  P 

15 

7  58- 

799 

41 

X 

lb 

799- 

819 

20  P 

17 

819- 

824 

5 

X 

18 

824- 

842 

19  P 

X  p 

19 

842- 

860 

18  P 

20 

860- 

38  2 

22 

21 

882- 

902 

20  P 

22 

902- 

904 

2 

X 

23 

904- 

924 

20  P 

X  P 

24 

924- 

968 

44 

25 

968- 

987 

19  P 

26 

987- 

1005 

18  P 

X  P 

27 

1005- 

1025 

20 

X 

28 

1025- 

1045 

20  P 

29 

1045- 

1061 

16 

30 

1061- 

1081 

20  P 

X  P 

31 

1081- 

.1084 

3 

32 

1084- 

1104 

20  P 

33 

1104- 

1112 

a 

X 

34 

1112- 

1132 

20  P 

35 

1132- 

1140 

8 

X 

36 

1140- 

116C 

20  P 

37 

1160- 

1171 

11 

X 

38 

1171- 

1191 

20  P 

39 

1191- 

1 195 

4 

40 

1195- 

1215 

20  P 

41 

1215- 

1232 

17  P 

42 

1232- 

1250 

18  P 

43 

1250- 

1265 

15 

X 

44 

1265- 

1285 

20  P 

45 

1285- 

1297 

12 

46 

1297- 

1315 

18  P 

X  P 

47 

1315- 

1334 

19  P 

X  P 

48 

1334- 

1340 

6 

49 

1340- 

1360 

20  P 

X  P 

50 

1360- 

1372 

12 

X 

51 

1372- 

1392 

20  P 

X  P 

52 

1392- 

1401 

9 

X 

53 

1401- 

1419 

18  P 

54 

1419- 

1437 

17  P 

X  P 

55 

1437- 

1454 

17 

X 

56 

1454- 

1472 

18  P 

X  P 

57 

14,’2- 

1490 

18  P 

X  P 

58 

1490- 

1500 

10 

X 

59 

1500- 

1517 

17  P 

X 

60 

1517- 

1535 

18  P 

X  P 

61 

1535- 

1590 

55 

X 

62 

1590- 

1610 

20  P 

63 

1610- 

1615 

5 

X 

64 

1615- 

1635 

20  P 

65 

1635- 

1800 

165 

X 

1 
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Table  XVI 


SUCCESS  IN  STATISTICAL  PATTERN  CLASSIFICATION 
AS  A  FUNCTION  OF  THE  NUMBER  OF  VARIABLES 
(Variables  Correspond  to  Range  Numbers  in  Table  XV) 


All  37  P-Ranaes  (Run  A) 

10  P-Ranaes 

Onlv  (Run  B) 

Consecutive 
Number ina 

Identification 
Number 
of  Variable 
cf  Table  XV 

%  of 

Human 

Signatures 
Classified  in 
"Human  Class" 

Identification 
Number 
of  Variable 
of  Table  XV 

%  of 

Human 

Signatures 
Classified  in 
"Human  Class" 

1 

64 

71 

64 

71 

2 

56 

78 

56 

78 

3 

57 

83 

57 

83 

4 

47 

83 

26 

87 

5 

46 

83 

44 

91 

6 

26 

86 

7 

18 

86 

8 

10 

no  readout 

9 

30 

88 

10 

8 

no  readout 

11 

16 

94 

Additional 

92  to  94 

92 
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Figure 

Number 


LIST  OF  FIGURES 


Title 


1  Carbon  Bed  for  Removal  of  Residual  Organic  Vapor  from 

Hospital  Grade  Cylinder  Air 

2  Fluidized  Bed  Sampler 

3  Transfer  of  Sample  from  Bed  Sampler  to  Injector 

4  Sample  Injector 

5  Sample  Injection  System 

6  Calibration  System 

7  Portable  Air  Sampling  Apparatus 

8  Two-Column  Apparatus 

9  Secondary  Resolution  of  Two  Human  Signature  Components 
on  Apiezon  L  Column  of  Two- Column  Apparatus 

10  Three-Column  Apparatus 

11  A  Segment  of  a  Typical  Gas  Chromatogram  of  Human 
Chemical  Signature  Obtained  on  Three-Column  Apparatus 

12  Chemical  Logic  Field  for  Classif icat ion  of  Data 
Obtained  from  the  Two-Column  System 

13  Problems  in  Pattern  Similarity  Comparisons 

14  Improvement  in  Correct  Classification  of  Individual 
Chemical  Signatures  with  Increase  in  the  Number  of 
Computer-Selected  Peaks  Employed  in  the  Classification 

15  Discriminant  Functions  Field  for  Humans  vs  Cooking 
Area  Vapors  and  Classification  of  Additional  Humans 
Using  Derived  Functions 
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Derivation  of  Discriminant  Function  for  Classification 
of  Human  and  Environment  Chemical  Signatures  and  Test 
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Signatures 

Improvement  in  Correct  Pattern  Assignments  with  Increase 
in  the  Number  of  Variables  Utilized  by  the  Stepwise 
Discriminant  Analysis  Program 

Derivation  of  Three-Class  Discriminant  Functions  Field 
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Display  of  Two-Column  Data  on  Human  Chemical  Signatures 
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Figure  2 

FLUIDIZED  BED  SAMPLER 
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Figure  3 
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Figure  5 
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Figure  9 

SECONDARY  RESOLUTION  OF  TWO  HUMAN  SIGNATURE  COMPONENTS 
ON  API EZON  L  COLUMN  OF  IWO-COLUMN  APPARATUS 


I 


105 


j-i-h-i  1  • 'pri i ■  wtVi 


flame  ionization  detector 


flame 

IONIZATION 

DETECTORS 


AUXILIARY  HELIUM 


FLOW  RESISTANCE 


4  -ft  X  l/8-m  PACKED 
APlEZON  L  COLUMN 


50  ml/mtn 


40  ml/min 


6-ft.X  I  /8-in  PACKED 
APlEZON  L  COLUMN 


SEPTUM- 


FLOW  REGULATOR 


FLOWMETER- 


HELIUM  -*■ 


5  ml/min 


MANOMETER 


HELIUM'  - 


200 ft  CARBOWAX  20M 
'OPEN  TUBULAR  COLUMN 


INJECTION  PORT 


MANOMETER 
FLOW  REGULATOR 


flowmeter 


Fiyure  10 

THREE  COLUMN  APPARATUS 
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A  SEGMENT  OF  A  TYPICAL  GAS  CHROMATOGRAM  OF 
HUMAN  CHEMICAL  SIGNATURE  OBTAINED  ON  ThREE— COLUMN  APPARATUS 
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Figure  13 

PROBLEMS  IN  PATTERN  SIMILARITY  COMPARISONS 
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PERCENT  OF  MI5CLASSIFIED  CHEMICAL  SIGNATURES 
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NUMBER  OF  VARIABLES  (PEAKS  SELECTED  BY  COMPUTER) 
UTILIZED  IN  CLASSIFICATION 


Figure  14 

IMPROVEMENT  IN  CORRECT  CLASSIFICATION  OF  INDIVIDUAL 
CHEMICAL  SIGNATURES  WITH  INCREASE  IN  THE  NUMBER  OF  COMPUTER- 
SELECTED  PEAKS  EMPLOYED  IN  THE  CLASSIFICATION 

Classification  based  on  49  signatures  arranged  in  5  groups. 
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Figure  16 

STATISTICAL  CLASSIFICATION  PLANE  FOR  VAPOR 
ViATOGRAMS  OF  TWO  TYPES  OF  MALES  AND  FOOD  PREPARATION 


NOTES  TO  FIGURE  17 


Wi lcoxon-White  ranking  test,  applied  to  the  positions  of 
points  in  the  negative  direction  of  X  discriminant  function 
only  yielded  162  as  the  rank  sum  for  females  (rank  sum  for  the 
smallest  group) .  The  critical  point  for  error  probability  of 
0.1  percent  is  252.  Thus,  the  validity  of  the  X  function  in 
classification  of  males  versus  females  has  considerably  less  than 
1:1000  chance  of  being  in  error. 
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Figure  20 

COMPARISON  IN  THE  IMPROVEMENT  OF  TWO-WAY  CLASSIFICATION 
OF  CHEMICAL  SIGNATURES:  USING  VARIABLES  (KOVATS  INDEX  RANGES) 
THAT  FREQUENTLY  CONTAIN  PEAKS  IN  HUMAN  SIGNATURES 
(P-VARIABLES ,  FULL  LINE),  OR  VARIABLES  THAT  CORRESPOND  TO 
RANGES  IN BETWEEN  (DASHED  LINE) 
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Figure  21 

DERIVATION  OF  DISCRIMINANT  FUNCTION  FOR  CLASSIFICATION 
OF  HUMAN  AND  ENVIRONMENT  CHEMICAL  SIGNATURES 
AND  TEST  OF  THE  FUNCTION  ON  29  ADDITIONAL  HUMAN  CHEMICAL  SIGNATURES 
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Figure  22 

IMPROVEMENT  IN  CORRECT  PATTERN  ASSIGNMENTS  WITH 
INCREASE  IN  THE  NUMBER  Of  VARIABLES  UTILIZED  BY  THE 
STEPWISE  DISCRIMINANT  ANALYSIS  PROGRAM 
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Figure  23 

DERIVATION  OF  THREE-CLASS  DISCRIMINANT  FUNCTIONS  FIELD 
(HOLLOW  SYMBOLS)  AND  TEST  OF  THF,  DERIVED  FUNCTIONS  WITH 
ADDITIONAL  MALE  CHEMICAL  SIGNATURE  PATTERNS  (FILLED  TRIANGLES) 
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APPENDIX  I 

Tabulation  of  Chemical  Signature  Data 
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EXPLANATION  OF  ARRANGEMENT 

Human  chemical  signatures  are  arranged  in  numbered  order, 
approximating  the  chronological  order  in  which  the  data  were 
calculated.  The  environment  chemical  signatures  begin  after  the 
human  chemical  signature  No.  69. 

The  tabulation  is  a  copy  of  the  computer  printout. 

The  upper  line  in  each  pattern  describes  the  source, 
("SUBJECT"),  gives  the  consecutive  pattern  number  (one  per 
pattern),  and  initials  if  the  source  was  a  human  subject. 

The  second  line  is  the  data  of  the  chemical  signature 
collection. 

The  third  line  (Total  Intensity)  gives  the  total  area, 
in  sq.  mm.  at  0.1  x  1  electrometer  setting,  of  all  gas 
chromatographic  peaks  of  the  particular  pattern,  and  the  mean 
peak  intensity  (total  area  of  the  pattern's  peaks  divided  by  the 
number  of  peaks  in  the  same  pattern) . 

The  columns  in  the  tabulation  are  as  follows: 

(1)  First  column-  consecutive  number  of  the  peak  in  the 

particular  pattern. 

(2)  Second  column  -  retention  time  in  terms  of  Kovats 

retention  index-cf.  text,  section  3.6.5. 

(3)  Third  column-  the  area  of  the  peak,  in  sq.  mm.  calculated 

to  the  electrometer  sensitivity  setting  of  0.1  x  1. 

(4)  Fourth  column-  normalized  area,  equal  to  the  area  of  the 

particular  peak  divided  by  the  "Total  Intensity1 — 

sum  of  areas  of  all  peaks  in  the  same  pattern. 
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SUMMARY  OF  PATTERNS  BY  CROUPS 
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APPENDIX  II 

Gas  Chromatographi c  Positions  of  Peaks  Obtained  in 
Analysis  of  v'apors  from  the  Same  Caucasian  Female 

Analysis:  Isothermal,  200-ft  support-coated 

open  tubular  column  with  Carbowax  20M. 
Data  are  listed  on  Kovats  Retention 
Index  Scale. 


NOTES 

(1)  For  those  ranges  of  the  chromatograms  which  were  outside 
of  the  calibrated  range,  the  peak  positions  are  given  in 
minutes  of  retention  time  instead  of  Kovats  Indexes. 

(2)  Values  below  K.I.  =  700  correspond  to  the  very  initial 
parts  of  the  gas  chromatograms  and  are  much  less  exact 
(of  order  of  +  30  units  at  extreme  early  range)  than 
the  values  in  the  higher  K.I.  ranges. 

(j)  The  last  column  represents  difference  in  Kovats  Indexes 
between  peaks  and  shoulders  observed  in  Run  B  and  the 
closest  matching  peak  in  Run  A.  A  plus  (+)  mark 
indicates  that  peaks  in  pattern  B  did  not  seem  to  have  a 
reasonable  counterpart  in  the  pattern  A  within  the  limits 
of  experimental  error.  A  dash  (-)  means  that  a  peak  was 
observed  in  Run  A  but  apparently  had  no  satisfactory  match 
in  Run  B. 
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In  the  work  on  the  detection  and  identification  of  human  chemical  signatures, 
experimental  and  computerised  statistical  techniques  were  first  developed,  tested, 
and  standardized.  The  experimental  techniques  solved  the  problems  of  sample 
collection,  recovery,  injection  into  the  gas  chromatograph,  calibration  and 
characterization  of  signature  components.  Ihc  sample  collections  were  based  on  the 
principle  of  equilibration  and  were  performed  by  means  of  Apiezon  L-coated  Teflon 
powder  in  the  form  of  a  fluidized  bed.  The  collected  sauplcs  were  transferred  into 
a  special  injector  needle  by  means  of  an  1 1 TRl-dcsi gned  transfer  system.  The 
injection  Gf  samples  into  the  gas  chromatograph  was  performed  in  a  timed,  automatic 
and  reproducible  manner  with  a  specially  constructed  injection  system.  The 
calibration  method  permitted  correlation  of  the  concentrations  of  known  signature 
components  in  air  with  their  respective  peak  areas.  The  sensitivity  of  this  novel 
process  is  of  the  order  of  one  part  per  billion  (by  volur.e)  of  an  organic  component 
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